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Pie BP TToOrts mwithin the Command and Ccutrol community. 
Vespite the penchant fom NEWeEr, better, and more 
Poems heated requircvment, there must exist somes meecure of 
EMigiases Gr the perscnnel reeded tc train with. cperate cn, 
aa tememrain thie readiness of, such -.eaulprert. PPrson wet 
consideraticrs cannct be> “tl vereed es or Chr m7 .eCSt Steerer s 
Testemedmro 1dertify optiral systems or eéEguipmrent. When 
these considerations areé carefully examrined, then the data 
Obtained from such pregrems can be effectively used to 
ScieemmersOnner subsyster ceésign and implerentetion. 

Reemerscnire! subsystem test pragrem is cne which places 
Paeeeeerri Site s@mphasis on sepeenien Cem aene (oda ne GuleT ene. 
Movermeunet. 1] Pnurerates six cbhjectives necessary for a 


Euie@ecessaul test program. 


2 


mommeValUate whether trem system can be cperated, 
Gemovained and controlled ty the trerscnnel assigned to 


a. 


Pemeebomaetvertine the efrect of hufrer perfermance on system 
EMmeormadace and vice versa. This cbjective is aived 


Pramdiasccovering critical inedequacies in wman-machine 


14 











BOeLETSCTION and stbs@€quently identify; changes that 


would simpreve their compatibility. 


=. me wdevelop valid qualitative end Gud, La tive 
peugsenmne | requirements, selection precedvres, ara 
tatles of organizational manning. Fow many and what 
tiyeece OC. Pecpit® will provide optimal etfectivenes¢ cr 


the man-nechine interface? 


Selo @Valuatewindividual and/or lenge term operaticnai 


readiness and applicable treining programs. 


Se mom evaluate jtraining e€quiprent and SUPOCr tue 


Mmeterials. 


Colecwevaluate job dids, technical publications and other 
morse == fer training and for aessisting cn tre Feo 


pertormance. 


Mien edsec Ercductivity threugh autemation involves twee 


J 


major issves; techrologicel and human. Speech ‘5s 4 uniquely 


at 


PumawmGameblllity. Speech recceniticn ty @ computer involves 
Betuwemer a Machine tO eaCCEpt, reccgnizée, and correctly 
mMeSDOnd) to SLOKeGer resseges. This machine must take the 
Pctemcr, COmrer® It eea4inst the expected pronunciation 
DOuciemiwdol@® UtLT@rances, id@ntity tih@ intenced wessage or 
imuenemeenmdnd produce the correct dnd apprcpriate resporse. 
mites ace ly implement the capabilities or such a 


technology, [remmeecuyectivesw=anove beceme ally the mere 








mem eweant. Cf crararount importance’ is the human, for it 
takes people to make all this automation work. 

Speech reccgnizers ccmmeéercially aveilable tcdey are 
erfective only within narrow lirits. They heave relatively 
small vocabularies and “frequently” contuse werds. Within 
this context, it becomes incumbent uron the user to cevelop 
the skill tec talk to the recognizer (Ref. ¢f: Dee cewe As 
such, a reccenizer’s performance wiitl very widely from 
Spedmem to Speaker. 

men of the work in speech recognition has centered cz 


the Severorpment and improvement cr speech recognition 


gevices. tor example: 


=- Jinear Predictive Coding (IPC) tin early “74s 
Seeryodenmic programming 


aes oeveloprent of 1 milliozs tit/ste processors 


A user’s exverience nectwitkstanding, the human variable in 
Pecoewemon perferrarce remains streng. Unis has often been 
Cosemveawern the past end even led tc a deseripticrn cf user 
Gavecories (Ref. 2: p. 2] of “sheers and ‘goats’. These 
speech recogniticn systems work well Yor the “sheep” tut the 
MemOmme, Of) the problers ere creeted by e srall Segment of 
Poeweooulation - th® ‘goats’. 

Recognizing the significant impact that engineers Eave 
Dad wome percvetuatirg the continued advent and technological 


aavdme@ement of speech reccenitiocn, it is nevertheless, 








critical to remina ourselves of the interdisciplinary netvure 
St, speech recognition. Besig@es engine@ring, the tctal 
discipline cf speech sciences and technology inclvies such 
Dncddwapmoneal diccitlines as psichology, linguistics, anatomy 
ea, eysiclcsy, cemputer sciences end humen factcrs. This 
Picco cudeavors tO examine the impact of human factors on 
maceouecesstul recceniticn of speech, prircipally addressing 
Demat eNCES Or variability arong wsers. 

Piret, the modality cf voice input will te examined 
citing <scre of the more wvéadily apperent edventages and 
misadwaentages, and an cvervieyw rrovided as to its poteatial 
apovlicability in a Command and Control @nvironment. With 3 
general arpreciaticn ct sceeci reccgnition (the terr ‘voice 
reccgniticn is synonomrsus ind used interchangeably within 
this document) in hend, the variety of khuran factors that 
Cotmummmat the successful recognition of spe2ch by a rachine 
wibbetaenm be summarized. SoS g Ue Vicon SOT Lime Gat 
rethodology used me C€Xaminae sand difierentiate  speeck 
mecOpmmerron equitréent users will be presented. Lastly, the 
SkDEnmmental r@sults will be presented énd en analysis 
Proverded of the correlation of Gach variable examined to its 


associated errcer rates as well as ar analysis c? variance. 








TI. CGMPUTER RECCGNITICN CF SPEECH 
— ee ee ee ee Ee Le eee eee 


A. OVERVIEW CF VOICE INPUT TECHNCLOGY 


Speech recognition can be consiaeread as € svoset of a 


broaaeér Pred known as Speech Understeéending. DEeCeen 
Understénding Systems (SuUSs have the eniee time of 


mrverpreting the intent ofr the streaker whether or not the 
user's speech is grammatically ccrrect or well formed. 
Bhile Speech Recognition Systems (SRS) are primarily 
meee reseed in te correct recceniticn cf every word, SUS are 
concerned with the meaning of entire conversational 
s@emernts. 

Until now the only significant undertexing has been the 
BreeeeeR croject {Ret. J], a five y@ar effort with the 
CoyeCu yet OF COtCal Irn s a creakthrecveh aie speech 
understanding Capability that wou la Prien) a Ieows —tre 
eve meerent OF LEractical ran-rachine communication systers. 
pecumeeecily, the cbjectives were tc develco eae 505 that 
movutumemasccert continuous speech ftror rany cooperative 
speakers of a gereral Ameérican publics a system which used 
Symuectrec analysis, semantics, pragmatic information and 
Pmecomres FO acavire an avprepriate ccrputer respcecnse. 

ieewzoars Of screech recognition, in contrast, ére i1€ss 
ambitious. Instead cf abstract concents such aS meaning cr 


Pogtemerencing, SRS try to solve the rere practicéel tproblers 


1& 











Of aMadlyzing the acoustic wavetorr and applying ratters 
recegniticn techniques in order tc ¢ifferentiate between 
utterances (Ref. 4]. Figure 1 illustrates a typicel sveech 
recegnition model. 

Wemacoustic Speech signel is first analyzed to extrect 
SuCh ~acoOvstic Larareters as frequency spectrum and the 
Swersyaein different time segments. NEXT, information 
Cant yaeaeer@atures ere extracted that cefine variots rhoneéetic 
events srvch as how neisy (fricative-like) the cignal is, 
Positwogs scr different vyowel-like sounds and vibratior of 
oie Smieaner S vocal cords. This infermration is then used te 
divide the syeech into tire slices or segments and are 
weve lied with ~Tnonetic categories. tne fphenetic sequence 
fOr Geese Sineut speech is matched to stored seutences c? 
GmeeCmea fronunciations for the words in the 1eéxicon cr 
Mich MOmetm, 20d the peSt hialtching sequences are determined 
Pompemete=trest likely word(s} that mad cccurred in speech. 

see cn reccenition systers Can be consice rec as 
pelenging tc cne of twe categories; ccecntinucrs (connected) 
Or isolated (discrete) speech systers. Continuous svsters 
een meme se wren can e@xUract infcrmaticr from strirnes of 
words even thoveh the words run tegethér as in natrrel 
Speecn. Ibsormaved SYSTEMS require a snort Sause tefcre end 
after utterences that are tc be recogrized as entities. The 
Giaimem duration of a patse is tyzically between 14¢-22¢ 


moec.mean iscleted werd recognizer is also limrited in the 


oo 
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duration of the spoken utterance, usually 2-4 seconds. 
Continuous speech recognizers are just now beginning to 
appear cn the market but are expensive and their 
GCapabivities and reliability have yet to be realistically or 
preactaeally @valuaveéd. For the rerainder of this thesis our 
discussion will be confined to discrete reccgniticn systems. 

Twemother concepts of speé€ch recognition to te discussed 
are that cf speaker independence and veccabdtlary size. 
Speeker dependent systems are those which require speaker 
adaptaticen (cr “training’) in erder to echieve reccenition. 
This is in contrast to speaker inaepend@ént systems which 
Mel) We@ecenize speech réegardiess of the speaker. in terms 
Of” Smerma recognition e€qvuiprenrt end tise ir essociated 
VOCaoWHaries , rest recognizer. work well with smrail 
voceoulsries of 18-50 words [Ref. §&: oe EG]. The 
pPoSctmmaty of confusion tetween words increases as the 
vocebuléry size increases, and to some extent the chance of 
sSirilar sounding words increases Vita such larger 
vocabularies. 

Avot S juncture ait iS appropriete to expand our 
definition of “words” to G@ncomrass more than just individual 
words. As used herein, ‘word’ it used interchangeably with 
the term “utterance” énd may be e€ither a Singular mono- or 
PolvVevwmeubeabic word or a combination of mono- or polysyllabdle 


words joined into a phrase. (ie. Flace-a-Circle-on-Moscow}) 





Tmeetecur processirg functions [Fe?. 6] contained i. a 


limited vocebulery voice récognition systemr, as shown in 


Beueuteee, CONSiSt of a tYransduc@®r, preprocessor, featrvre 


extractor, and a final aecision-level classifier. 


Le, 


nN) 


Transducer: The micrcphone is the interface between 
the user and the syster and converts the spoken phrase 
moo E€L@ctrical signals that aré analyzed ty the othé@r 


Gempcnents or the system. 


Eepeprocessor: NCCU l Te mOW tor 1S | Terresen ted, 
meee@urdl iInrtormaticn must be E€xplicitlily or irpiicitly 
Gomteained in all speech Exscodires. The 1 eta 
analyses precduce parametric representaticns [Ref. 7! 
and take place in the pretrocessor. This segment of 
meme sSyStem trenstorms the speech signal in order ts 
enhance certain properties and make ther more easily 
SaeeGeracle if a spem@ch recognition systen. The signal 
TS nOrmaliz@éad in time “Sy dytnaric prograrming for 
SUBSEQUENT comparisons Woo00 yarious Reterence 
BauLerns. Lata Compression removes any extraneous cr 
irrelevant intorme tion. EOoth tine and freqvency 
Gomdam atdlytical té@échnicues are perzormed or the 
imp signal. Seeecn enalvysis 1s echieved by either 
direct analog spectrum analysis via fast fourier 
transform (EFT) in the frequency demain, or linear 


predictive coding (LPC) in the time domain. 


N) 
nN) 
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Figure 2. Processing Functions of a Speech 
Reccgnitiorn System (Fror Reterence €} 
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Beature Extraction: The key processing function in a 
Fatteérn recognition system is the feature Extractor. 
The more optimal the set cf acoustical features 
extract€d and s@nt to the classifier, the less complex 
the classifier need be to achieve a given accuracy 
level. This segment of the system troduces a set 
hnumter cf significant acoustical features (depending 
on the individual recognizer) a few of which include 
epecural Siopes, phonetic cldssificaticn, and initial 


estimate of word boundary. 


weetocicter:  <Inh® classitication process 18 perrorned 
in software using a mrinicomrputer. When @ speaker 
issues az utterance, the encoced ffatures and their 
Mme. Of Occurrence are stored in short term remory. 
The duration of the utterance is broken into time 
Segments and the features reconstructed into the 
normalized time base. Reference fatterns, rreviously 
input by the speaker tor the syster’s vocabulary of 
memos are cOmpared’ to the feature cccurrence patterns 
and a ‘best-fit’ or “closest-ratch” determined for a 
Word decision. The number of bits of inforration for 
the Reature map Or eecadci ee ne tecrence ~~ paltern. 1s 
determined by mapping the number of acoustic features 


MmLoetLne csumrb@r of time segments. 


a4 





The first two processing functions are accomplished by a4 
hard wired Preprocessor ~and “feature <extractor. The s 
echreves real-time precessing since only the classification 
function is yperformed in a general-purpose minicorrputer 
eer e> ¢. 177). 

A discrete word recognizer must be ‘trained’ Loe 
individual talkers and/or words. This can be dene by a user 
Simply speaking a set number of training samples into the 
Mevic@e tO pErcvide a reference set or fe@etures. The system 
EeOoleSs ue memory the reference set cf word features for Eeech 
word (utterance) the user has spoken. Cnce the syster is 
Poainea, whe USe@r may speak werds intc the device during 
normal operation and these are compared with the stored 
patterns. The ‘closest fit’ is selected as the recognized 
word . This sequence of events is ccmmoenly pertitioned inte 
Poe. trampling wod recognition moaes of crperation. 

TWeee are two types of Errors that can Cccur in speech 
recocmurion. dine fipeteis G@arejyection, or the inability of 
tre recognizer to corr@ctly classify en utterance. ae 
Seco@amye a7d in 6&6 practical sense rore troublescme, 315 @ 
Pisr@@e2zdition. This occurs when the recogrizer classifies 
an utterance as something other than what was spoken. 
Better recognizers usually have recognition algorithms 
designed to reject rather than guess at questionable words. 
Higher quality systems such as Threshold (Models €EZ¢€ and 


689) have error rates that are quite acceptable [Ref. 8, GY, 





Oa Wexetensive experimentation has shewn arprorimate error 


rates to te between .2 and 11.4 percent [Ret. €: pp. 179 
180). Of course, what constitutes az acceptable errer rate 
is critically dependent upon the perticular application and 


data entry rate. 


B. THE VALUE CF SPEECH RECOGNITION 

The Tepartment oft Defense has teen very active iti the 
past few years in their efforts to assess the méritis of 
Voice reccgniticn with machines. Such lecaticns as the 
Naval Postgraduate School, Wright Pattersor Air Force Base, 
Rome Ayr Development Center, Naval dir Develcpment Center 
and assorted cther ggencieés ana cortrectors, hate conducted 
Sr Leteeve TEStS in order. to Examine huram intéracticn with 
machines throveh the us€é Of voice intut and other 
Todalities. In order to comprehend the need for further 
research Petia ine ton VOLcCe inpur tCechnolosy, =<2t 15 
essential to review the advantages and lirlitations taat this 
type Or Technology \sorters : Were’ ip pertattyys it? is 
Esceimmae tO understand its rotential capahilities and 
applications in a military environment. PSmes ve cen 
recognition beneficial (consiaering costs ot Soo = 
eeGyeeot), practical, .ancd usable te justify the continued 
expenditures of reseerch and aevelormrent funds (€.1 and €.4) 


and operational ronies. 


Zc 





1. ‘Advantages of Speech Recognition 
Proponents of computer recczrition of speech will 
continually extol the virtues and unlimited peseibilities 
the technology offers. In an abbreviated fashicn, the five 
general advantages of voice input to machines may  o.be 


Surmarized as follows: 


== Natural communication 

== 1 relning 

meu lvinodal communication 
== Rast comrmunicaticn 


Samrrror reduction in data incut 


meeecn iS cur mcst natural meade ct communication. 
It is a familiar, sSpontaneovrs ana convenient rethod of 
expressing one’s thoughts, ideas, cor intentions. Untrained 
Peer Sao: S.VO1ce Tecozgni tionssystens, resardless of whether 
miley @dn reda, Write, type cr keypunch, qar all speak cr 
Make sounds. these Character! Si16€30 01 § the speech incut 
focal Muymmaxe it applicable for users at all general skill 
Level 2Om Systems Engineers tO cemmuter operators to blue 
COtlam@ewOrk®rs on an assertliy iine. 

A user cf spne@ch recognition equipment requires 
ivi veweor no training. They have only to restrict their 
Spoken utterances to those which the machine can reéeccgnize. 
In mye case of discrete systers, isolsted words aré 


S@parated ty a short pause so as to ease the location of 





wOrd boundaries anda word choices to which the machine has 


De@n tralinéa to recognizé. Although this appears t9 te 
disedvantageous, it is more reslistically a compromise tc 
nMatural sreech in that na adverse affects are caused the 
user in terms cf operating the speech recognition equinment. 

Meeperimentation |Ref. 11: >. €@8) has shown that 
speech, instead of interrupting communicaticns necessary tec 


perform other taskt, can enable users TO do these task 


in 


Simultaneously with voice and therety reduce or at a 
minimum, not add te the time required to perform a complex 
task. The advartage of having one’s hands and eyes free to 
mo MG@tmet tasks if perhaps the pivotal point iz the 
adetermination of epp.icadility Of speeck ré€coOgnition 
aq@vices. This rultiwoaa. aspect allows us to flace the 
microphone anywhere (headset mounted, hand-held, on a stard) 
ang still cormunicste commands and irformration. Thresnold 
Technology even has a wireless micrcphcrne [Ref. 12] that 
Ben eomererens) ve NOOLLity while talking to computers. 

The fastest modaiity for communicaticns by a hurar 
1s spe@ech. An individtal can speak twice as fast as the 
ewerepe tyrist cen type (Retr. &: ry. 45]. This has been 
clearly demonstrated by Cckman and Chapanis [Ref. 11] whose 
experimental results showed that cormunication via 
typewriter or handwriting could not epproach speech in terms 
of sp@e@d or task erficiency. Further substantiation fror 


the Navel Postgradtate School [Ref. 8: p. Z] showed that 


ee 








moiee Bntry was 17% faster than tyring, after only three 
hours or training. Additionally, while speech recoreniticn 
Mecuracm 15 Slightly degraded ty mental or motor loading of 
the user [{Ret. 13: p. 32], voice is nevertheless ‘aster and 
rore accurate than other inptt modes when the user must 
perform another task while Simultaneously interacting with 
the speech reccgrition equipment [Pef. &: p. 2] 

fPaewm CeiSecle@ar thal Speech recczcition Lrermits 
meepa @Ntry directly into the ccrputer without intermediate 
Steps suck aS Wanual transcripticn or keypunchine which aré 
Subject to errcer. Again, resé€arch at the Naval Postgraduate 
School has shown that 165% rore errors occurred in manval 
Hid Mmanmicuiation (typing) than ty voice ([Ref. & p. 2]. 
Buren COmmon entry errors aS the transposition cf digits, 
mhich are usually Caused ty eye wovernent or other 
distractions, are almost eliminated with the use end 
automatic speech recognition [Ref. 14]. 

Zoe mir cations of Speech Recognition 

merce particular technology was dewoid of errors cr 
moeachmeal limitations, we could assure universal application 
and implementation. Although the advantages cf speech 
PecCcgehutLon ar@ se@mingzly well @statlished, there do exist 


several preblems associated with the ability to speak ito 


mecnines:. These liwitations include: 


~- User variability 


Sew COnStre@inéd speech 


ao 





—— Isolated spe@ch 
-—= Breath noise 
=e User confusion 


~~ Environmental factors 


Speakers exhibdit & wide range Ot personad 
Smeracteristics that add a signiticant measure of difficulty 
in the ability cf a machine tc recognize speech. A 
speaker's Sex, geographic Cc ane and articulation 
experi@nc® are just a few of the elererts that result in ia 
user’s variability. Consistency is also a key e€lerent in 
successful reccgnition accuracy. A speaker may talk gqrite 
differently in training the machine as compared to when he 
Peectemiicvesuse it in a practicel arriicetios. Additionally, 
Physicel changes in the speaker such as age, physical 
Coudition, stress (Ehysical or emotional), or fatigue, to 
mame a few, cén induce variability that will ultimately 
epemecresucc@SsSitl recognition accuracy. 

Meeetcelated werd reccenitior system imposes a 
restricted (constrained) vocabulary both in terms of size 
poe CGComeveot, upon the user. This tecomres a Limitation when 
WE consider that most people aré accustomed tos Speaking in 
DeaGutceeer went prove. Because of the limited vocatulary, 
Moercomemuse oe Careful eof the types of werds included fer 
Ree locmmeutonm. In@ similarity of s0und Structures between 
words (ie. Nine vs. Time) adds a teasure of ccnfusicn that 


Gam Sups@quently affect overall rperformance. Tesign of 





eepoCamuleary tcr a particuler application is an imvoertant 
and controllatle factor in determining the acceptability of 


oe “eo 


MOice@@aemrut for a given task. 
Recause isclated werd recognizers depend 
Significantly upon the detection of a minimum peuse between 


words, wOrd boundary detection becomes trerhars the sing 


}—— 
ay 


Mest weritical JLlimitatior. The usval methcd 1s to meesune 
changes in energy levels [Ref. §&]. An isolated wora i?% 
detected at a pcecint where the enerey in the accustic signa! 
meceemauome @ CETtein threshold. At the end of tke word, 
ies eoerTeye drors, and the resultant silénce indicates that 
GoewUaerance iS Over. But, energy fluctuations are nor 
PmoUPmeeero detect @ikt word bdundaries, and taus adveuced 
aqetection techniques will have to involve detection ena 
mel W@erson Cl StOL consonants witain words, while eliminatinz 
pauses dve tc “lip-smacks” cr breath noise. 

Mmied Linited vocebulary, isolated word recognitisn 
system, breath ncise can be a seriaus problem [Re?. 4: fr. 
174J]. An inaiviaual who is inveclved ir littie or no 
physical moverent while engaged with @ vecice recceuition 
System cen achieve very high recogrition accuracy. cies 
seecubaey cdi ScOn deteriorate once the user begins tc move 
around. Inhaling will not cause any eédverse affects when 
wsSing a closé-talking, nois@=cancelling microphone, but 
exhaling will preduce signal lievels comparebdle te speech 


levels. As rhysical activity increases so does one’s 


Ol 





breathing vattern and as a result increased erxrhalaticn vill 
lead to the above mentioned deterioration in recognition 
acctracy. 

far Je Vic ice Po wt ERevides multimodal 
communications, thiS particular advantage hes an inherent 
limitation in that the user can become contuséd as to what 


mode toc wu 


in 


e. AS a result, inout modalities can beccme 
eonutused,nand interfere with each other sc tbhat tre total 
rate of information transfer may net be as higtk as the sum 
Of the rates pessible with each Seperete mrodelity. 

Moaliy, tne e@nvironrent i23 which whe sz,eech 
reccgnition deévice is placed may héeve en inédvertent affect 
Sur eeeenition accuracy. For examrrle, speech retognition in 
en aircreft cockpit may be desradé€d due to engine neise or 
eomt | wevineg WOlce Cranating Vie emer cd tt radio 
eormunmications. Orr; Conse er the placement of <¢uch 
Pechnoloey in a crowded Military Command Center where its 
memidtaeenuy can be affected by backzgrcund noise from ether 


members located in the nearby work Ssface. 


Pee aeePO Ao ILITY OF COMPUTER RECOGNITION OF SPERCH 
wee commercial Applications 
mremranrst vVOlece input syster7s to be used by industry 
were installed in late 1972 and early 1973 [Ref 1&]. These 


CaTly apelications included: 


per cdidivtyecomtrol.and inspection 





Sa, cUtOmamed material handling 


Bema eCl VOICE invut to corputers 


Their successful implementation was due ir large part te 
recognition accuracies that were greater than or e€qual to 
the manual keying accuracies ottained trom the came 
personnel. 

Meamost gqiality control and insrpecticn zrocesses the 
inspector s hands eand/cr eves are SCC) Chanelle 1s Deer Ech 
mek. meorough th® ws@ ot a voice recognition system it is 
possible tc combine the insvectcr’s normal work requirements 
Mith the SsSinmulteneovs entry of asll data meéesured and 
observed. Pious aeideds COPpOrdatlon imstvalied voice déta 
mur y Equipment in early 1Y75 for the inspecticn of cclor 
t@levision tacerlates. Here was an apriication where the 
inspector had to manipulate, orient, and measure parameters 
using gauges and meters . The requirement to simultaneously 
record the measurement data also existed. In this example 
the operator was atle to echieve both tasks at once [Ref. 6: 
mp. 122-1835]. 

Voice entry has been utilized in recent yeers to 
eou tren the roverent Of Materials suck as percels, 
containers, béggazge etc. throvgh distributicn and sorting 
Centers. A voice Control, Ceommececrace TOUTINS systven 
PoStaiewean Dy SS Kresge in Nevember i974 allowed just ocne 
operator to, handle each item, read the label, and speak the 


@estimecrion code fcr each carton intc his/her micrcephone. 





Formerly this had be@n an operétion that required two 


rersons and still resulted in the “burching’ up of different 
2 2 packages. Fellowing tne Mrastaltlavvonw ct voice 
activated scrting e@quipment, the tunching probvler was 
eliminated, productivity increased, and serting errors 
meduceas ikef. €: Lr. 1E5] 
2litery Atrplicatiors 

These applications mey be ctlaced in the general 
Seevegommes Of, Equipment and frocess control, field date 
entry, asta management, and ccoperétive man- machine tasks. 
SecOtmemeeetiritive classification was proposed vy EHeék et. 
ee inwesr? (Wef. 16] tec include the general ereas of 
BeEcurity, Ccmmrand and Control, Data Transmission and 
Communication and Processing Distortea Speech. Lee ae 
Peomeees dad recapitulation cot rilitary tasks that could be 
eens cered for speech recognition techrolcgy. 

Oje pelrticular interest is the use of speech 
recognition ftcr Command and Control applications. The term 
CS, command, Control, and Communications, reférs tc an 


overall system comprised as a minimum of these key elements. 


as Gemmand Authority: The commander provid#es tre central 
aeipmontuy, Unity of purpose, and the overall concept 
goto how cperations will be conducted to accomplish 


mission objectives. 
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TABLE I 


MILITARY AFPLICATIONS FOR SPEECH RECOGNITION 
(From Reterence 1€) 
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I .@eleurRIiTy 


Speaker Verificatico (authentication) 

Speaker Identification (recognition) 
Determination of eEmoticnal etfects (ie. stress) 
Reccgnition of spoken codes 

Secure access voice idenstification 

Surveiliance otf communication chanzrels 


bef FA td 


II. COMMAND AND CONTROL 
A. System control (ships, aircrart, situaticn 
displays, etc.) 
Peme voice Operatéd computer input/output 
5. Lata tendling and record cortrol 
LT. Material handling (rail, baggage, publications) 
E. Rerote control (hazardous materials) 
Pooeaadm@inistrative record centre! 
[ils e tA TRANSMISSION AND COMMUNICATION 
A. Speech synthesis 
Bee vocoder systems 
Come bandwidth reducticn 
LT. Ciphering/coding/scramrbling 
IV. PROCESSING DISTORTED SPEECH 


Diver speech 

Astronaut communication 
Unaerwater telephone 
Oxygen mask sreech 

High “G’ ferce speech 
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8. Organization: This eElemént provides the pathways 


through which the plans, priorities, and directives of 
the commander are provided to the force ané@ throtgh 
which information pertaining to the forces can be 
Proviged the centrel authority. These pethways are 
found at ®@ach echelon in the fcrmr cf ccmmand pests, 
op@rations céntérs, or command centers. 
oe Cormunicaticns: This provides alle means fOr 
transmitting plans, priorities, and orders to elemerts 
of the force and the means by which the forces fey 
inform the Commander cf their activities and needs. 
deeeeemtermation: A key element that facilitates ccentrol 
by confronting the Commander with On Vee Ub det 
Pmeotmatton required tO support the decision-mariae 
process. iNscrmation supports Heth. the stass 
planning and command decision-making process at ail 
Tevels. 
The command centers that will provide the reguisite 
organizational framework, srertorm several vital functicas 
for the Commander. First, is the capability te communicate 
Pecuremy, ema preferably ty voice over a wide choice of 
erecumrse. se@ecndly, G€ach command center has the task cf 
integratine inforration which comes fror its sSuprorting 
Silememrs. §“S third capability rtrevided ty these centers 15 
the processing and display of information. Wee: Clr ta 


tunction, associated with number three, is tne qvrick end 





accurate dissemination of information, Teports, ad 
directives for the Commanaer. 

Wevare Particukerly interested f@ the “unetioa cf 
information processing and dissemination as it provides é 
wuicdnee deelication Yor computer récognition of speech. 
Command center automation, resultive in@more. cefficrent 
Pommuneseaolons, Will lead to increased troaquctivity. Ia its 
broadest sense, communication is the management of 
meeorgmrion, dnd luformation, not paper, is the chief 
Deoduct of the command center. Cur C3 systems that ere 
designed end fielded for these Centers, and speech 
Reccegnition as “a cCcompoment of Such, can pravide. our 
Commanders the capability to observe’, “decide, “act, and 
“react with speed, decisiveness ena accuracy. 

Navy feasibility studies sponsored by Naval 
eeectromrces Command and conducted by Dr G.k. Peock of tae 
Naval Postgraduate School, Cxamined the potential tor voice 
dave €utry fcr Command, Control, and Communications. Two 
voice recognition systems were installed in 1986 at Fleet 
Headquarters, Commander-~in-Chief Pacific (CINCPaCFLT) in 
Hawaii to e@€xamine the benefits anda tiiritations of voice 
input for cperation ct the Worldwide Military Command and 
Control Time-Sharing System (WWMCCS TSS) and tke Ccean 


Surveillance Intelligence System (CSIS) Ret. 17: ns ell. 





Poock has also demonstrated that using voice inrut 
to exercise a typical scenario on the ARPANIT, an 
experimental network since 19€Y employing packet switching 
technology and ccnnecting over 15@ host computers, was 
Significantly faster and meré accurat® than entéring the 
commands manually [Ret &]. Twenty-four subjects fcllowed a 
fixed scenario of instructions wrere they accessed the 
ARPANET, logged into different host ccrputers, read 
mecc@e2e>, Sent messeees, read files, transferred files 
petwe@n host comrputers, deleted files and interconnected 
host computers. Simulated commana centers operating on this 


network include the Navel Fostgradvate School (Monterey, 


= 


Calitornia), Naval Ocean Systems Center (San Dies, 
California) and CINCPACFLT (Hawaii). 

mubgomatic Spee€ch recogniticn has also teen found te 
heve considerable potential for imagery interpretation and 
moveilieence report generation [Rerf. i7: t. 49]. A 
Significant amount of reéesé€erch has been performed for the 
Defense Matring Agency (DMA) for such applications as voice 
data entry for tke precessing of Digital Landmass Syster 
Satovmedata, pretar@tion of Flight Information Putlication 
(FLIP) data énd ocean-depth meesurerents for digitized 
corvormmeameniCGerplications. In all these applications tHe 
environment is such that the operetor’s hands are busy and 


Preouegmtily involve the use ot stereo optics and other 


Suecian | deyices. Voice has been shcwn experimentally to be 





mastery E€aSier, anc a less fatiquing rode of data ertry thas 
Bistorically fore conventional means [Ref. 17: p. 27]. More 
recently, tne feasibility and advantages of voice input 
technology were descrined Yor use in the CCINS Network 
GControw Genter (CNCC). The Cormunity On Line Intelligence 
system Pere rconnects on-line joformation storage and 
retrieval systems iccated at a number of locations witnin 


the United States intelligence community [Ref. 18]. 





JIT. HUMAN FACTORS IN SPEECH RECCGNITION 
ee ee ee ee 


wee DN TTOM AND PURPOSE 

Himen factcrs is concerned with impreoving tne 
productivity cf the user by taking into account human 
Charact@ristics in the design of a system. As descrived by 
Huchingson [Ref. 19: p. 4], 

The term human tactcrs is more comprehensive, covering 
ail Silomedical and psychosocial considerations applying 
tO man in “he syster. it wiacluces noe only hiumen 
engineering, but also life suptort, personnel selecticn 
and trelnirg, training eqtipment, job performance aids, 
and pertormance weasurerent and €valuation. 
moempeegmre rererrec to in this definition ere tnose vho 
PmapicameyeOperete, maintain or service the syster. They are 
those who will interact with the system’s design. When the 
focus is on a broader interpretation it’s appropriate to 
speak of a Human Factors Subsystem or Persornel Sunsystem és 
was d@scribed earlier. 

Human factcrs engineering deals principally with the 
many factors involved in the aesign of a2 new system - from 
Paecdwem= CO personnel. For ovr efforts in this analysis, 
tne current technology has been determined to be acceptatle 
and, experimentally as well as operationally relistle for 
his use "in a Command and Contrcl envirenment. New, user 


Variability is to be investigatea further in terms of now it 


maerectuss reccgenition accuracy. 
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Since energy ina sp@ech signal is usually displéyea in 
Reems Set ftrequency, intensity and time, it world seem 
plausible that each werd should have a unique acoustic wave 
pattern and, if $0, word recognition would be a sirrle 
matter of the voice recogniticn system scanning the pattern, 
comparing the sinple pattern with a data bank of reference 
word patterns, and oe which word was spoken. 
Unfortunately, human variability messes up this uniquely 
Simplistic approach. Our purpose then is to discuss the 
huran aS é€ component in a complex system designed by humrars 
and to note the fundéerental advantages and limitations of 
the human in relation tc an automatec voice recognition 


system. 


B. FACTCRS AFFECTING RECOGNITION ACCURACY 
1. General 

Limitation otf vocabularies to 122 words have 
mesulteéd in idé@ntification eccuracies of between 98% - 9Y% 
in a controlled laboratory environment. In an operational 
or field setting recognition accuracies have been reported 
es bommms £02 (Ref. 22: p. E2ZE]. Various factors noted for 
mmLert@ming with successful identificaticno have included 
background noise, inconsistent TLeropnone placement, 
insufficient training, itconsistent sreaking style, and the 
lack of user cooperation. Lea in a peper titled What 


Causes Speech Recognizers to Make Mistakes? [Ret. 21] calls 
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Por tme determination of those factors that ‘influence 
recognition accuracy rather than the repeated assessment of 
transitory devices. Table Z summarizes the four “dimensions 
of difficulty” Dr Lea has prorosed. What needs to te 
accomplished is the characterization cf the relative erfects 
of changes along each of these fcur dimensions, or more 
Sip. vestated, find the factors influ€ncing the accuracy of 
machines that recognize speech. 

Because there are so many variables involved thaz 
atfect recognition accuracy, the list in Table < may be 
reorganized in a commrunication-theoretic framework. This 
framework models the speech recognition error rate as a 
Mmmetiemect seven complex sets of factors [Ref. £: pp. €9- 


93] that include: 


=—- lask ractors 

—amnuman Factors 

-- Language Factors 

—-- Channel and Environmental Factors 
a wleorithmic kacters 

-- Performance Factors 


-- Response Factors 


It is the set of Human Factors that this experiment 
euduedmaiysis is principally ceoncern@d with, for it is this 


Stage of the mroael that has a pajor impact on speaker 
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TABLE I! 


DIMENSIONS OF DIFEICULTY FOR SFEECH RECOGNITION 
(From Reference §&) 
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TASK AND 
PERFORMANCE 
RECUIREMENTS 
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HUMAN 
VARIABILITY 


LANGUAG& 
DIFFICULTIES 
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ACOUSTIC 
DIFFICULTIES 
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korm of speech to be recognized 
Accuracy réquirerents 

Required thrcughput rates 

Type of device necessary 


Sex 

Dialect 

wocel treet size 
VOcalecOrdmoadarac gers 
Prenunciation habits c 
Physical state 
Psychological state 
workload 
Cooperativeness 

Time or day/week 

Time since training 
Nurber of training sarciss/word 
Rate of talking 


tt ct 
¢ 4 


Sieve yo. active sucroca fslary 

Yord length 

Word sOund Structure 

Confusatility 

Langvage spoken 

pyuvacyrC, sefdantle, anal 
pragmatic corstraints 

Enhanceaodility 

Sees Satter 

Intonational variability 

Rhythm and tiring varisadility 


Noise level 

Type(s) of noise 

Bandwidth 

opectnal distortions 
Transducer characteristics 
Flacement cf the transducer 
Amplitude 

Wi Orati cn 

Acceleration 
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Terradigity. This set of human factors’ can be furthér 
Subdivided [Ref. £1: p. 2] in order to monitor their 
patluence on r€ccgnition error retes. A few of these are 


listed below: 


== opeaker Experience 

wena ninge Method 

maeoex Of Lhe’ Speaker 

~- Fhysical Limensions ct the Speaker 
aamocoeraphic Crigin of the Sceakxer 
~mmopeaker Dialect 

peeeuysical Stete of the Steaker 

—- Psychological State ot the Speaker 
—~- Speaker Cocperativeness 


-- Tire of Day or Weex 


PecaWsemaitterent sp@akers May demonstrate widely 
varying methods of pronouncing words or phrases, the above 


listed factors may be further setarated into two categories; 


) 


Cr 


those occurring between sveskers and these affecting each 
individual speaker. ‘First, some of the differences between 
Speexers sti, induce VWariability will be briefly eramined 
and then the variabilities apparent within Sach speaker that 
cane armeect reccenition accurecy Will te discussed. 
<. Lifferences Fetweén Sveakers 
Sooemer a2xperience: This factor cen take on a twe- 


rOol@ Weaning when looking at it as @ Source of variatility. 








Peerst® 915 the experience of using voice Gecor aston 


equipment. Baperienced voice recognition users shoula be 
expected to have a higher and more reliable recognition 
accuracy than those who are “naive” to the technology. 
Meese Szperienced users are comfortable using the ecuirment, 
less likely to be intimidated by the syster, and ere 
moe lemmee with 10S performance carabilities from previous 
usage. The other meaning of speaker experience has tc 10 
meecn joe Skill. Can auser who operates in a *ricrorphone 
BevyerOompest cnr a Haily or regular basis, such as an Air 
meaitigmecntrolier or a Pilot, be expected to heve better 


recognition ates than those whe have never spoken intc a 


ry 


Microphone? A aata processor who works regularly in én 
Saviroument actranding precise data entry by keyboard might 
have the type of experience or skill factor that would 
MmEcVideEe an edge over a prospective user possessing only 
basic typing skills. his type ome exnerience overlars 
mele ntemeewilth sg@aker cooperativeness and will be elaborated 
Bpon later. 

Methoag of Training: The ideal form of voice 
ma>eracui100 would be for a user to fick up the wicrorphone, 
Speek commands the machine can unaerstend, and for the 
meoroOpmeaLte response to take place. Naturally, this is the 
goal of speaker independent systems, but since humans all 
Oecmem—eierserently and our torm of speech recognizer is 


discrete, we are mandated to provide the machine some 





information about how we speak each word intended for our 
desired vocabulary (ie. Training). The method by which the 
machine is traired ty the user will in large part dictate 
sutsequent recognition accurecy. If the user is closely 
supervised and mace to carefully speak the obparticulaer 
vocabulary then we should be able to CPXpPEct hizher 
mecOgmiticn rates as opposead to the user who is given 
cursory instructions on the use of the eqcuiprent and allowed 
Pee gO >On independent of further supervision avrineg the 


training rode. An adjunct of training method is the number 


+~4 
th 


of training “samples” ocr pronunciation pattern. iat 
mem iCimee tO achieve accurate speech recognition noe e oe 
number of training passes per word is srall or ¢maller than 
Manufacturer specifications [Fef. Z2]. Using identical 
equipment, BeeewOurd Still be reaschabie to anticipate some 
Speakers, having had a lesser amount cf training samples ver 
work, Waving more succéss than others who have had mcre 
yeavpues, Der word. 

» EX: Male voices kave lower frequencies than 
terales anda more detailed s,ectral structure results *rom 
The lower pitch of their voices. This detailed Structure is 
Hose mercdtive of the vocal mechanism dnd of the interded 
vowels and consonants spoken. ValemmvolLCeS “tena | LO fare 
better with reccgnizers employing trequency domain anaiysis 
while female voices tend to have greater success with 


rachines using time domain analysis [Ref. 5). A recent 
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comparison was conducted [Ret. 22] which revealed no 
Statistically significant aifference between the sexes. 
Although not a [rimary objective of the thesis, it remains a 
source Of variability that merits some measure of analysis. 

Breeaner Jialect: DiabecuUsmnot -Onlys ciiecy "the 
Specific sound proauced for €ach vowel or consonant tyne, 
Pipedadlco Exhibit dif*erent dynamics of speech production. 
For example, Southerners have tneir readily identifiable 
drawl, whereas a New Yorker will tend to say Toid rather 
than ‘Third and residents ot Camtridge, Massachusetts can 
be heard to talk about Hahvahd instead of Harvard. 

Physical Vimensions: Throtghout the literatrre on 
speech recogpition one will see speaker variatility 
Seeger tweed) tO a Variety ot tactors, none of which include 
moe pDaysicel dimensions of the Steaker. An examination of 
MmmewreGeenition accuracy for a selected sarple population 
based on physical dimensions would provide an interesting 
Poole iteerotOo 6LDE|6Urraamificaticns of such a factor 45 64 
compon@ént within a perscnnel selé€cticn subsystem. In other 
WOTrds, whet ettect, if any will height and weight have on 
Pecognigicn accuracy? 

Geographic Origin: ths Carluicular factor is 
MuLvidwmensional consisting of several sub-facters which 


require careful examination: 


—- Place of birth 


-- Geographic area of upbringing 
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== itheic backgr@und 


s- neligious preterence 


The above may impose ideosyncratic or social aifferences in 
habits which can produce Variations in sound and 
Subsequently in pronunciation. These Sup-4actors 21] 
COntridbute a feasure of variety that can presumably affect 
meCOeNmtidon accuracy. 

©. Dittferences Within Speakers 

Physical State: The present physicél state or a 
user ofr VoOLce Recent Ww Otmmec em enlmm Cen BPreciyiaatle 
Mmariadieity in his or Wer voice. For exarple, a cold, scme 
Porm Qf Eathological condition, fatigue’ etc. can alter the 
speaker's veice. The individual ’s vcice quality coula te 
different based on physical conditioning. Is the wser who 
morkS §6ut regularly and stays in excellent Ciaees Cae 
condition more likely to show higher recognition rates than 
one who rarely exercises, smokes regularly and gérérally is 
not in the best cf health? 

Pewecnolorical StateweeSp.elterger (Ret. 23:s07r~ 29! 
defines transitory or state anxiety as a complex, uniaue 
emotional condition that can vary in intensity end flrctuate 
over time. State anxiety may te thought of as consisting cf 
unpleasant, consciously perceived feelings of tension and 
eeerememsion with an accompanying activation or arousal of 
mie @lligencmic nervous system. The concept of trait anxiety 


refers to the relatively stable individual differences in 
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emsretye;reneness. It may also be a retlection on the 
frequency and intensity with which state anxiety has been 
previously manifested and tre rcrobebility that such anxiety 
fo Oecur 6itlh6Uthe future [Ret. 23; p. 39]. The tact that 
physiological functioning is affectec auring periods of 
me rGtye ts casily epparent. The degree to which sreakers 
deal with a state cr trait anxiety may well te a significant 
mirada of COnNSideretion in the examwinaticn of error rates 
wee vOlce recognition systems. 

Speaker Cooperativeness: Hew enthusiastic and/or 
Meiings a speaker is toward the use of voice recognition 
equipment could induce sneaker variability and hence 
SgseqWenmrmeGeccenitior accurecy. In 6 military environrent 
mere many jcb positions are cf a non-voluntary variety, it 
mecOmeciwveDie to expect the selection of Woice recognition 
moors wmoO are told to orerate the equipr@nt regardless of 
moe@ir “eeersonal preterences. t. "Glee USemmedIStrusts the 
meemmowmoey Or treters frranual entry, aud, is still required 
moreuS@empewGice, we have developed a non-ccoperative user. & 
mem—-CO@rperatrve user is theretore, cne who is consciously 
Dryingwto undermine the successtul operation ct the machine. 
ime COOperdative user is one who is willing to help the 
machine by saying precisely what the machine wants and 
pronouncing it in a clear and consistent wanner. There is 4 
mntdtlige wrey area surrounding this factor with the presence 


auseme wno, although not consciously trying to confuse the 
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device, are not tuliy committed to “helping the machine’ to 
mecogenize the correct utterances. 

Tire of Day/¥eek: Eech person’s speech is veriable 
depending upon time of day, changing from mcrning te evening 
and even changing progressively over a period of time ([Ref. 
5]. An examination of recognition performance over extended 
MeriodsS cr time {Ref. 24: fp. 1] Showed a Statistically 
Stable performance over time (21 weeks) with no sericus 
degradaticn occrvrrins as time elapsed. Nevertheless a user 
Rao Ndeweead gar in tire between training and operational use 
may torget ary special ways he/she trained the machine. KEcw 
mimeo Om ad Gar is tolerable is a sutject for future research. 

ees Ce llarecus factors | 

some aaditioral hvuran factors that have been 
prepesed [Ret. 5] deserve a briet descripticn. They have 
Meen relezated to a S@€parate section becavse, for one reason 
er angtner, lacw of eqtuinmemt, current teckmical skills, 
Mitek Wet Weasurabl® quantitative data etc. experimental 


examination at the present time has beer precluded. These 


mactors include: 


meerorm of speech 

So opeakKer d@pendence 
=- Kate ot speech 

=se vocal tract size 


-- Speaker’s glottal spectrum 
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Porm Of spe®@@h refers to the type ot voice 
recognition syster to ove used, isolated or cortinucus. 
Continuous systems, being a quantum step above isolated in 
terms of complexity, bring about a4 greater opportunity fcr 
Speeker variability to manifest itself. such things as 
detection of werd tcundaries, slurring of speech (ie. “dija’™ 
vs dia you ), and prosodic characteristics could seriously 
Seeeect@erecOesnition accuracy because of these tyres of 
cemplications which a continuous speech recognition svstem 
PaAtroduees. 

A Speaker independent syster negates the requirement 
or training and thus variability betweer speakers becomes a 
more Cratical factor for independent systems to contend 
wivn. Independent recognizer pertcrmance will kave to te 
taillcred to accemmcdate an unlimitea number of pctentiel 
Seeateiemanrd their associatec variability. 

The fester a person speaks the more likely thet the 
Soetcummeronunciation will be altered due to slurring, 
gqaeleted syllables, etc.. If amachine 15 traired te one 
POThm) Om tCronunciation ana at one farticular rate of speech, 
Smigtremine rate in an application mode, will cause én 
Picreagemero recognition difficuity. With an isolated word 
reéccenizer tc te used in the experimentation, requiring a 
rinimum of 100 msec pause between utterances, and utterances 
pOvpcmme@aane <.90 seconds in duration, this particuler 


factor was not considered essential to the overall analysis. 





ies Gesoer, an important factor in terms or continuous 
mecogenitgeon systems. 

Poe Size of the vocel trect will produce changes in 
mie fOmmants of the speé€ch signal; the sraller the vocal 
Erect the higher the fermants. This cen heave an impact cn, 
HOT exanrple, transmission throveh Piriced. band vied tn 
channels. Vecal Come Chawacleris tics alec predice 
interspeaker veriability such aS pitch or resonant quelity 
of the voice. Speakers with more ‘resonant voices that 
preject well, will be easier for recognizers to handle [Ref. 


hme p. Tel. 
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A. OBJECTIVES AND CONSTRAINTS 
iH, Objectives 

eS roved earlier, Our overall oODjJEcTI¥e vas ato 
examine the human as a ccompconent in a complex system. In 
Barrower terms, this experimentation attempts to assess the 
meme Cl Mme Grr: ering cccupational, cperaticnal, perscnal, 
me siomerweal, and psychological characteristics of a user, 
On thee accuracy with which a currently availatle veoice 
recognition system Fall COEreet ly IpGeReret spoker 
ioc LeCTamees . DU seclenuly, = our aiscussidnewillP@eadress ine 
mecurremee, if dmy, ct existing quartitative parameters that 
cold Wemeprer us to differentiate tetween effective and noo- 
effective users of voice recognition systers. 
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meee rollowime Specific ckaracteristies are exeminz 
mo this wpnesis. Many ot the individual characteristics, or 
momen Medetors, are self-explanatory while others are 
prevwide@ “with a brief explanation and/or rationale ter 
SC1ECt Ben. 
aeeeccUrcatiOne! Characteristics 
This set of tparameters examines the possible 
effect on recognition accuracy due to differences inherent 
jaa Weer s Occurational skill or jot (military or civilian) 


paemera@mma. specific characteristics include: 
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erftecs 


moo function: Comparison of recognition rates 
between microphone experienced users (ie. pilcts, 
air traffic controllers) and non-experiencéd users. 
Franch of service: A Sacvor with possitle 
SCOnSequences pertaining to its use in personnel 
Sernection criteria. 
Job satisfaction: A subjective evaluation by the 
Mee teds LO Nisvyher job satisfaction in their current 
duty assignment and their satisfacticr within the 
Armed Services. 
Previcus computer experience: Ccmputer erperienced 
personnel (ie. Tata Processors) are expected to 
have a petter appreciation for the advantages of 
Moeceme Input and thus, be mere conscious of their 
Swerorus and positively Rotive teu LGR higher 
reccgnition éaccrvracy. 
Foreign language competency: Prequemtiy military 
and civilian members asscciated with TCD are 
required to possess the carpactility to fluently speak 
aq, ftoOreign ltanguazge. This ability is another factor 


that could affect one’s speech. 


b. Operational Characteristics 
iais set Of paramevers examines the pessible 


Ome FECOENItioOn accuracy due to fectors surrounding 


P2e Oferdtional use of Vice ce Be Coch t On Squier eC uu 


BC Clti@me Characteristics include: 
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S—meeraimingM@rethod: <Anm@lysis of recognition rates tor 
those users who are suservised during the training 
fode ccmpared To tnose who are allowed to train the 
Seuipment individually. 

-- Time cf day ard week: A determinaticn of whether 
the time frame in which a speaker trains the 
reccgnizer will have any subsequent a‘rtféct cn 
meccenition accuracy. 

Ss mesa iprent C€xperlence: COmterison On, —recoezn) ta on 
meaves DEtWEen exXperienceH users of voice recognition 
equipment and those who have never us#d tee 
equiprent betore (“naive’ users). 

—~—meese Of USE: ,De OPenravl Cina l= ==sinpiie Lily «of. the 
equipment could atrect a speaker’s pertormance. For 
exemple, a Speaxer who considers the recognizer és a 
Comelex and operationally ciftficult device will he 
mec nel ye 16, c€evote his cr her maximum e€ftcrt te 


mmeir performance. 


Cre, YeTSONal Coaractsristics 
ire fcitewine are varicus Characcerist ic: 
considered to have a pessible etftect on an indiviaual’s 
SueecnwmecatbGerns, and hence, affect the reccgeniticn accursecy 


of a voice syster. These perameters include: 


—— meno ce 


—“Mee’drital Status and fanily size: AS COrTeLate (Oe 
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have 


psycholcgical state and, althecugh equally Likely tc 
mest meivded as a psychological characteristic, it is 
mencidered Dete acs do wermiterteur ior “perscnrel 
seiecticn. kamily size refers to the number of 
ofitspritg the user has as oprcsed to the size tamrily 
iu whick one was raised. 

Religious Dene eneey thn a Cackground 

meeent (r dialect 

mmece Of birth/gseographnic crigin 

Level of education 

pocioecunemic class; Similar Ieee neture to tre 
SuerwactPristic cof maritel status but is considered 
ve Lowmerjtwin sefectienu ct personnel thar tor its 
Givect ©n individual speech ratterns. 

teomeal Or crthodontal care: Braces, corrections for 
Wmmecorer bite, Or major Oral Surgery, are considered 
Mime © Inplication On the  Sfreecn vcatterns of 


these individuels and the resultant error rate. 


feeuys iological Characteristics 
These characteristics eére also consiaered io 


erect ON Speech and aS a Fesult ere fectors of 


PyvereCcwme wet exdmining freccghniticn accuracy end speaker 


variability. TInese peramreters incluae: 


Heigznot 


Weight 
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-— ae ec 

scsameeisesical ccndition: A subjective evaluaticr py the 
Bser of his/her current physicel condition. 

-— Rate of airtlow: Measurerent Os ventilatory 
Ponction to k}roviade é& diagnosis of condition 
affecting voice. This measurerent can also be used 
as ar ircicaticn Ci. pessicl 6 airway (ecstruction. 

Somoldt Cepacity: lhe maximtr erount of volume of air 
Momce can ce exhaled ztellowintg maxirum inhalaticn. 
PepecredsurTre Provlces an €Stinate or the amount of 
emeerecvcubiailv availatie fer the producticn cz 
Paonation. 

Bee cco urcining:s Lkxemines whether formal speech or 


mepecent raining aifects recceniticn accuréecy. 


mee o,enOro2:ical Characteristics 
PmenCurrent DS y7EnclCeictal state Cir 4 User, Their 
mec perauviveress, and their perscnel attitudes toward 
mieeemammen and veice all contribute toward the overall 
merectmmon m@reccenition accuracy. The yperticular pearareters 


Movestmeete’d lnociuce: 


~~mepeeychcicgzicat enziety 

“mee aker COOperataveness 

——WeRT®ect at errers on subpsequent performance 

—- Attitudes toward voice recognition equipment 4s a 
time savirg job ad 
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—=—"eettitudes towards computers and data automaticn. 


In efrect, iters 4-6, ere relatea to speaker cooperativeness 
ae that now a user feels about cormp~uters and voice 
recognition covla impect on their willingness to reliesly 
peeepOrtmone use of voice recognition equipment. 

Commeeornstraints 

MesOuerisSmient Of TEST ODJECTIVES Bwere constrained 
Mepis towepne res@€artch facilities or the Naval Postgraduate 
menool. In the interest ofr time, experimentation was 
mem itvecmito sive weeks. 

Fecausé voice production iS én extremely complex 

Beret seme walch auditory, acoustic, end aerodynaric events 
meer reamced Dy the interaction ct physiological mechanisms, 
Mee weula ce bené€ficial it we cculd measure as many vocal 
mepeometerms as possible in crdér te achieve a complete ane 
accurate eeu 1 re CL vO sce UnOGUGlLOG sm) ts cSS0icC7e Ted 
eects atcnhe speakers, and its correlate to vcice 
Mmeccenmricn accuracy. aC ss caemecOllL Ment. time, send/cr 


Peeeertlsee ~reclude€d C€xamination of such factors as: 


=“Memsottal wevetorm 

S-—Meeears:er function of the vocal tract 
=-meremmca-pressure Level 

=——Shaxifftrm duration of susteinéed fFhonetion 
-“weaeinum tTrequency levels 


=~mmeadal rreyuency level 














oe SUBECTIS 

Forty-four subjects participated in the experiment on a4 
volunteer basis. The group was compcsed of £5 military 
mriceme, if military enlisted, and 2 civilians. The 
Military cificers representing the Army, Air Force and Navy 
coosisted of ci males and 4 fteéemales while the enlisted 
perscnrel representing the Army and Navy consisted of 11 
ees) cdma, C terales. The civilians includee a protescor fror 
Poe Nee Cceancgraphy Department and an employee of the 
Detense Manpower Date Center (IMEC) in Monterey. ie. he nik 
mee rcdmmea: Lhe military subjects ranged trem O-z< te O-4 Yor 
mecmcommme si Obed officers, CU@c< to CWS for the Warrant 
Meerrcoiem end HO tc =z=7 for the enlisted personnel. The 


Sule cwemaees rangea trom <B to 47, wits an average age of 


It wes GeSired that the Speakers selected for the test 
Pemrepmesettative ct the torulation ror which the recognizer 
igs to be used, in cur case a Command and Contre] environment 
PoC lime articuler, a militar, compana centér. Suojects 
Peemert in the e€xyperiment were representative cf this 
Environrert as Shown by the Sraage Aistribution ena types of 
Mrrivcmepeccuretional specialities, although some or these 
SpEcialmices are Bots Leaded pharcenGeue in Current ~ 408 
@eccmivmon (ie. Medical NCO}. 

Twenty-five cf the sucjects were frer srcrt Crd and 


mocliciewess variety of backgrounds such as pilots, air 





meaitric Controllers; Signal OT icers:, Signal nou = 
commissioned otticers (NCO s?, and infantry pliatocn 
sergeants. five of the subjects were data processors; 2 
rom the Fleet Numerical Oceanographic Center in Monterey 
Sages tron aamrinistrative offices of the Navel School. 
Iwelve sutjects were students at NPS and enrolled in the 
Command, Contrcl, and Connunications (C3) curricula. A wiae 
diversity ir their tackgrounds is illustrated by previous 
job categories such as aviation, comrunications, systems 
eooegTanmmiog, coOnmnrunications maiutesnance, conmand and starf, 
and ruciéer engireering. 

Twelve ct the subjects had exzterience using voice 


mescenoy.ior E€quirrent, having participated in previcus voice 


mp 


exgperimentetion (Ret. Gy]. &h summary of SUDJECT 


Beacacteristics is provided in Table Jil. 


fo ACP MENT 
ieee ce ReEcOzsnition Syster 

A Threshold Technology Inc., Model T-€00 voice 
recognition systen was used 6 represent a cormercially 
ewatletmee, State-Of-the ert recognizer; one which has been 
well documented as t0 its reliable recognition accuracy. 
Tre T-€@¢ is a Speeker dependent, isolated word, speecn 
PecOeNWroOn aevice wich automatically re@ecognizes spoken 
words and phrases. These woras and ghrases (utterances) may 


PE ads Priet as 2.1 second out will usually range from @.25 
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TABLE III 


SUBJECT CHARACTERISTICS 
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to 1.6 seconds and must be Separated ty very short pauses of 


-l second or more. The terminal allows a user to pvegin an 
utterance before it has completed processing the previous 
one, but in this experimentaticn rate ot speech was 
COmecOMmMeassy USe Of the READY indicator light located on 
Poe. tae Cartridge unit. This tLignt indicates wnen the 
terminel 1S reaay to accept the next utterance in both the 
treizicg and recognition modes [Ret. 25]. 
mae ihreshola €9@ in its Stendard ccénfiegvration is 
compiz sed otf the following tour elerents: 
-—Weeeerminc! ccnsisting of: 
mdagal02 Speecn Preprocessor 
= otci-ll microcomputer 
mer dieiteal KRS=2zoc)input/ourcue interface 
-- Me tataard CxI7Sey board Displey Terminal 
~~ Remote Voice Input Unit (Microfhone preamplifier) 
~-Getepe Cartridge Unit 
The terminal, CRIT cisplay, microphone preamplifier, ana tape 
Cartriage unit were table sounted (Figure 3) within an 
Secustieesouna reduction booth (Kigure seh. A conventicnal 
SHURE roael SW-1¥ ‘“boor microphone, suppliea as Standara 
equipment with the T-E€Z¢ was used. The microphone possesses 
wee cpCGmal ccise cemeelling designe. o)Ccheallows the T=690 to 
perfcrm accurately despite most extraneous bdackground noises 
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Meezure 4. “AcCUSLIC Sound Reducticno Chamoder 
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mimeo -eCherr eo ProOCesSSOrT a@acCeLts .the. speech stenel 
input from the micrertncne preamplifier and passes it thrcugh 
eee >PCGmmrai ahnalyzer fOr word boundary detection. The 
-Cature Extractor monitors ‘Yor 22 phoretically-relevant 
Beatluree, gnd COnverts these to aigital signals. Words are 
PeveClLeGmane™ cCCurrences of Low energy. A minimum pause cf 
Pet SCCOea TEST OCCur to f~revent confusion between words. 
mye Creat oaing neise at the ena of the word is remcved. The 
memaitnigme speech is divided into 1€ fixed tire segments, ana 
Seat uncem are reccnstructed ontc the normalized 16 segment 
fine base. 

fmeewielcr CGeagputer. dces a comparison of input signals 
agdirst storéa reference patterns. Each word is represented 
meectcmere «© oc) bits of informeticn. Semone Se ot mee iG 
Pace wmCTe LpcOnhicrs template and the alteérnetive stored 
maim ten plate is fund, and thet closest’ word is 
meee areCame che werd identity, vunaless toe sccre is so law thet 
—iocCceeemetimcapn OC fale dnd the utterance is rejected 
eutrighie. ive vocabulary nee Reuvce patterns are 
established py tne subject “training” the recognizer. Tides 
moe cCeOeme wsdea Cy «tne suoject mreking a set aurbder Of 
meee tultmons of the veriovs vocabulary utterances. 

mee a fateh iS round; the afpropriate character(s) 
ére sent via the cutput interface to the CRT to indicate tc 
oemucemewnlchn utterance was reccgnized. These ternal 


Matches are further categorizec as misrecceniticns, where 
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the termizel’s ‘closest’ match to the reference vocedvlery 
was now MieecH.se ly the sare utterance S20 fen no G 
reccgai@rens, in which the utterance spoken is exactly 
RecOCgmi@m@e-aeand so refiected in the Chi output. Rejection of 
én utterance 15 a third category and is indicatea by ar 
étaiole “veez’. 

ew remote voice input unit allows ccmrponents to te 
remetelyelgceted Wp to cee feet from the terminel processor 
ena provid2s the means to aajust the volume (amrplifricaticn) 
cf the amplirier te éeccommodate the normal speaking voice cf 
e2c0 Ud cular suoJect. 

Pre tape Cartridge uUrit 1s @ digital tape recorder 
useaqd to stcre ana recall épplication data and én individual 
sucject’s vocatulary reference fatterns. Once the data 
Sac Ul ideeee Seem ecOrced 1 contains all the information 
HeCeSscmm Gor initialize toe Ihreshela SG? terfinai for each 
eo j2&C te ieee O00 Se Cec pre scr mmcuCIIneg 5a 2°56 word 
MeGeCbuldeyewoleCao fay be recordeo Or icaded in a few minutes 
Poles Wie Cape unit. 

2. eee COmETEr 

fo recOmmenue “spirometer, Figure €, a Tare og 
gasometer, was used fOr measuring and recording vital 
Cacecity. It consists of a metlal tank containing a moveable 
Mees LON M@ewituno a water seal, dir input Line, exbaust valve fcr 
Pesettmme, ink stylus, and revolviag cylinder for fmrounting 


emacG @amer calitrated in cubic centimeters. 
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AS the subject odreathes into the mouthpiece, Figure 
ieee mepedcCes water in the ianer Liston, whieh rises by an 
emoOunt proportional to the exhaled air. The subject, once 
Peovean mite the mouthpiece, is given instrucvions to inhale 
lO the greetest extent possible and then exhale all the air. 
Tmis precedure was repeated tkree times and the average 
vital cepacity usea for analysis purposes. 

3. Fees flow “eter 

The Wright Peak tlow Meter was used tG measure the 
Pore Tuli tbhOWwe = Tate in a single forced @xpiration. The 
POStrumest, bigure €&, ccnsists cf a4 pivoted vane, the 
movetLlOnect wWnich 1s Opposed by resistance cf a spring. The 
mecaetvicumcuumpiece fFilS into the redial inlet whick leaas te 
mae vane. Pemuececnedsm LO GGE Vad 15 @ STindie and pointer. 
#he fcrced E€xpiretion causes the vane and pcister to rcectate 
meet Smee | maxifur attainable flicw has heer reached. Cnce 
merc ncGdmmmtr ge pCcieter 15 neld in position by @ rétcket until 
released ty a rese€t tutton on the tack of the device. The 
Beele lepegradteted in Liters per minute in 5 tLiters/minute 
Gav isiCme, OVET a renge or €8 to ig’? Lliters/rinute. 

peee@urdaliy, UMewSUbJEcCl StaedSvand acids the meter 
me a Vertical fplane as aéepictéd is figure $. He/she then 
tekes as deep a breath as possible, places the mouthpiece in 
me mouth, grifs it tightly with the teeto, and seais it 
Peon his/her lips. The supject blows out as hard as 


Messible€ in 4&4 short, sharp expulsion ct air. This procedure 
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wes cvertommea three times with the average norea 
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Seer cpriawe peax expiratory flow. 
=. ‘Tape Recorder 

An ASAI 4¢¢0 DS Mk-II magnetic tape recorder was 
used for the reccraing, Storage, ana reproduction cof speecn 
sounas (Figure 19). The device is a typical anaiog magnetic 
mace recorder moiehien tae Gf three bDaSic ) parts. These 
meowude tae Glectronics of the system, the ead asseroly, 
moa tle tepe tréensport. These cempenents teke a phencmeincn, 
Mme aS wee STEGch soOuna, that Cchanxes in tire ana recirdas 


me aS a cenvinuous event. 
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Se cewere ,eccrdéa tor ail @# subjects during their 
PamticCipetion in the experiment. Subject to availavility, of 
analytical software at NPS, further acoustical analysis 
coula De conducted on speaker variability thet might 


SuuaStdm@tmave and SULLOIrt statistical conclusicns. 


i. INSTRUMENTATION 

1 Dee Cmesticnnmaires were usea co euliee et Pac 
evaluaticns, judgement, CemMearisonus, atu Dudes. and 
mere rOmma nistcry of the svEjects participating in the 
experimentaticn. The *tirst twe questicnnaires were desigred 
[Rer. 2E€j] to preoviae the necessary information to delineate 
EyYOJEctS PEGG) Vericus groups representing those huran 
Tactcrs discussed Earlier. The third questicnnaire was used 
to meéeesure state eéna trait anxiety levels during various 
maeogscem Of the 2xcrerinent. ae questionnaires were 
“auther-administered in order to previde clarification, if 
Meeeced, 6G any written instructions end insure that all 
mespondents ccrdleted the questicnnaires ccrrectly, giving 
peer CPi te coOnrs.deération to Gach iter. 

teem pes Of CuesStionnaire items were used; open- 
meget iple choice, and rating scale. Tne open-ended 
items permitted the subject to express his/her answer tc the 
Guestion in one’s own words. In all cases, these questions 
required short (cne cr two words) otjective replies. The 


moO Choice qué€stions allowed each respondent to choose 
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meeeadpPimeeriate answer trom a list of several opticns. 
These multiple choice questions include “dichotomous iters, 
for example, those requiring only a YES or NO response. 
Finelly, rating scale iters were used to obtain judgerents 
Bre attivuaces about some object, coucert, or systen. fee 
Scestions permitted the assignment of various response 
Seeuecrnhateeves aliong an untroken cortinutr or in ordered 
Gatezgories alorg the continuunr. Betirm dad Serapnic scale, 
allcwirg the resjoudent to place his/her jvagement any place 
alorg tne line, and a numerical scale, corfining the 
SuDject’ ¢ response to a discrete céetegory along the 
men inuwe were erplcyed. 
1. User Questionnaire #1 

User Cuestionnaire #1 (Appendix A) employs a 
cOnOination of question items including open-ended, mvilticle 
megerce wand erarhnivcal rating scale iters. Cuestious -l=2z 
are adesigned tO odtain information pertaining to 


PeclLpatponrel, ~ersonal awd physiological characteristics. 


Questions ce-42 obtain atti tud ina comperison, and 
evaluation Poy Oorration pertalnibe to OCCLNd tlLOmak, 
coeréeticnal, physiological and psychological 


mearactwermistics. 
2. User Questicnnaire #< 
User Cvestiornaire #2 (Appendix B) utilizes a 
comsinaticn of questicn items including multiple choice and 


éréphical réting scale items. Cuestions 1-3 obtained 





information relative LO bby Ss PO RCo ae faGreGs waite 
questicns 41:5 were REpetiticous items Poh user 
Meestionneire #1 designed to obtain attitudinel informatior 
from tne subjects after using speech recognition euyuipment 
mor fOureWEEeKS. 
©. Meth] Guestionnaire 

Tne State-Trait nok Le oy, Inventory (Shwe ) is 
meen ised Of s€parate self-report scales tor measuring two 
Bastinc® anxiety ccrcepts: State anxiety (A-State) and 
meaty anxiety (A-Trait). This inventory was developed by 
mppelbemeer et. al. at Vanderbilt University sand later 
@Bomutinue@ at Slorida State University. It was rerroducea 
Meet tChesSpecial permissicn ct the Fubdlisher, Ccnsultine 
mewerecnOWerasts Fress, Inc., Falo Alto, Califcrnia. 

ane STAI A-Trait scale consists of e260 statements 
(Appendix C¢) tkat asx people how they generally teel. The 
Mctate scale alsc consists ot £8 statewents (Appendix TC) 
mee Cem imSliuctlens require subjects to indicate how. they 
meet dtvead Darticular moment in time. The STAI wes designed 
mom OC Meecli-acfinistered and was givéu indiviavally to @acnh 
Mimec mmcomoOlete instructicns ere frirtéd cn Each test 
monn fOmmecoth the A-Trait and A-State scales. There were no 
ome limits itposec tor completion ct the fecrm. Althcugh 
noewcnmmne items have face validity as measures of anxiety, 
Poew  ineeenteory was referred to as a 5@l DS iyvereie ticn 


GUuEStionnaire. Hacimeslioieet Tresponas LO every Sis, Lten by 
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Seclingmone aprrcpriate number to the right of each item 
statement on the form. Scoring keys are depicted with Each 


scale in Appendices C end DP [Rer. 27]. 


m EXPERIMENTAL DESIGN 

A three-factor mixed design with repeated measures on 
cne factor was employed in ees experiment. Tn 
consideration of tne wide variety ot human ftactors to be 
examined, tne experimert wes designed to allow an analysis 
cf three criticel factors (0Occupetional experience with 
Microphones, cperdtional trainiug wetnod and experience) 
eer eCl lige Pecconlt Lon accurecy while Simultaneously 
Peepeoecr ite SUifTICieCnt gata to dccorriisk subsequent analysis 
en individual cheracteristics of speexer variability. The 
Two betweeu VelmiopnesS wer]  Mberormone “Experience and 
training retnocd, The third factor, experience (Week#), was 
tHE within group variaolé. aA Summary Of the experimental 


mea1ren epepears in figure 11. 
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more tae :-¢€40, the training procedure consists of 
emperi ne vee pdsses meen Cach utterance into the voice 
recognizer. <A word list cr 1@@ utterances (Appendix EF) was 


Lrovided the subject, each utterance frompted on the CkT, 
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mies l’ passes Spoken, and then the next utterance on the 
!ist would be promptea. Hasea on the experimental aesign, 
subjects were divided into two groups; supervised and non- 
supervlsed. Those supervised during training received 


aetailed instructions, and close scrutiny on €ach ct the 12 


tes 


sees Wey the experiment administrator. If the subject 
failed to ciearly [Ercnounce the utterance, if volume level 
kaS insufficient, cf if the required .1 second pause was 
omittea, the word was immediately retrained. Non-supervised 
EMD jects Pecei ved the same is CRMC Tv Tonisn, a short 
menronstwetior of the trdising procéeaure ard, when ready, 
were allowed to train the equiptment individually with no 
Slee rvisSi0n by the experiment administrator. 

Training was accomplisned only during the tirst week 
of the €xperinert. Subjects training in the morning (@72@- 
122y¢ hours) woula subsequently test auring those periods and 
Mmepewloces Or those subjects training in the afternoon 
(14¢4¢-19¢¥4 hours). Inreaiately atter training, all subjects 
maaqe at least twe passes cf the entire 100 word vocabulary 
(Similar to a test session) to identity any problems in 
Training ct a particular utterance. If the utterance was 
morc ecym Paentizriea on both passes it was consider@€a as 
trained. kewever, ist an error (either misrecognition or 
nmon-recogniticr) occurrec, a third pass was made. If less 
meee 6 6twee6UClCUTHE «€6three passes of any utterance was correct, 


Taat utterance was retrained. 


te 





After the equipment was trained, each subject was 
Measured for vitel capacity and peak flow rate. Finally, 
User Questionnaire #1 was administered. Total time tor the 
training session averaged 1.& hours poreswuye Cts 

2. Mmeecognitior Testing 

Following treining, subjects were tested on the 
system. Kach subject maae c€ fasses tbrough the entire 
moce Ciliary 11st on each ot three days during the week. 
LTuration ct the experirent was three weexs. During. Week #1 
the vecabulary list réemainea in the sare order as auring 
training (Appendix E) while in week #2 the order of the 
utterences were reversed (Appenaix I!) and in Week #3 the 
order was randomizea (Appenaix G). The purpose of this 
Meme cewrmeyvccetulary crder was tc reduce the effect of 
learning aue to repetitiveness, anda therety provide a more 
realistic picture ot speaker variability. Tata was 
So lected i1f tne torn of recognitions, Fisrecognitions, and 
non-recognitiocns using Appendix da. 

The STAI questionnaire for A-State scale measurement 
mes admanistered just pricr to the first testing session 
(week #1, Trials 1-2) to aetermine eéenxiety levels prior to 
using voice equa pment. Dien g week #2 another STAI 
quectionnaire for A-State scale was administerea following 
Mcmi ist LESt sessiou of that week. The final STAI form 


tor the measurerent of A-Trait scales, was administered 


79 
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during week #é. User Questionnaire #¢ was provided to eé 
subject at the conclusion ot the experiment. 
3. Vocabulary 

It was desired that a test vocabulary ¢imilar to 4 
vocabulary intended for practical application in a miiitery 
environment be used. Ot CONCeEInD “ine the “desi2n ci Syae 
vOCaculary wes the fact that trief ronosyllabic words ere 
more Gitecult to recognize that longer rpolysyllabic words 
cr phrases. A relatively equal cistribution cf words and 
utterances contéining a syllabic content ranging from 1 to 
79 syllables was cseélected as the final vocatulary. The 
words weré chosen tctn from previous experimentation [Rer 
25] and tne authors military experience. Appendix I 
Provides a listing of the i899 utterances used in thé 


Smeoerifmeme aud ccrsidered as representative of use ins 


~~ 


feeritaryecONmand center. 
Gee «(C(VARSRREI ES 

them@evendent Variables in this experiment were tote. 
Emre rSyemee Lin@ar cOmbination of misrecognitions and aecr- 
RBeccenitaons . Inaepenaent variables in the overal! 
experimentel G@ecsten sabe @xXTerlence, job tunction, ana 
training method. Additional indepenaent veriables included 
sew Ofmepae individual hunean tactor characteristics elicited 


earlier 





woes COLIECtTed On the eleven subjects ‘within Each 
meu )6CUOe «CULE 6C ec Xperingntal d@sigu. Each subject made €¢¢ 
utterances per week for a grandad total of 1&8@@ for the 
experimeéat. Moa Ubterances tOr (He Tconplelea Experinerr 


nurtered 79,202 (44 x 1802). 
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Wee SANADYSIS @ND RESULTS 
ee ee ae ee 


A. GENERAL 

All analyses were performed nese the MINT ESE 
Statistical package [Ref. c&]. Repeated reasures analyses 
Se Mdlterce prccedures were ypertormed in accordance with 
guidance provided by Pruning end Kintz [{Ref. 2s]. Non- 
merdwiectmec LEeSts ter Signiticance between fyairs or reans, 
Semecral@e2ndependent scemples, and fcr trend analysis were 
Menaucted Utilizing precedurés aiscussed by Conover {Ref. 
oo}. Adaiticral peremetric analysis followed procedures 
Beescribed by Ott |Rer. cij. 

Sill mé€an error rates thet appear in figures are of 
untrenstoOrrea data. pance tne FF test in an anelysis of 
Yariance is vdlia even with mila aepartures from. the 
Un ptmaemor Caudlaty “ot variances [Ret. S31: p. 624], 
Hartley's Test for howogeneity of forpuletion variances was 
used LC determine whether an extrere case (unequal 
Variances; existed and wtherety determine if a transformaticn 
memeddtem would pe requirea to stabilize the variances. 
Fesuits of this test are presented in Tabdle IV. The 
Mesumpemeno ci equal variances is the basis for the use of 
untransformed data in ali sutsequent analyses. 


Tne eorrelation COCT rT Heme mt reported herein is 


Speérman’s Rhc. Althcugh the Fearson Froduct Moment 
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TABLE IV 
TEST FOR EQUALITY OF VARIANCES 


DEC VSaCN : 
mevel “of Signiticance: °.85 


Tabulated value of F ai OO 
Max 


CANNOT REuECT THE NULL HY FCTHESIS 
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meecATAGS < 1 
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momreiatmion coefficient “r° is most commonly reported, it is 
however, a&@ random variable, ana eS Such has a distribution 
function. Conover [Ret. 34] states that “r° has no value as 
a teSe Sbatistic ip nonparemetric tests unless the 


elsitrivdution is known. 
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B. OCCUPATIONAL CHARACTERISTICS 


1. Hyrotneses 


The 


occupational 


following hypotheses Eertaining LO the 


characteristics Cit speakers using vcice 


recognition equipment were tested: 


Job ftuncticn (microphone experienced users 
versuS non-microphone experienced users) 
will have no an rect on recoeniy len 
accuracy. 


Job function (microphone experience) 
eifeets ©TecoznltTlon accuracy. 


The pranch of service the mrilitary member 
oe iones LO Wide) have Oona! 1 CC lero l 
Recon l von  aecurdc y.. 


Recognition accurécy is influenced by the 
Drencn Of service of the user. 


A user’sS attitude pertaining to his/her 
MEesSe uy Job “Satistacrion Bwitt nave +20 
ae eet. On recceni tien acitumacy . 


JOb Satisfaction affects recognition 
accuracy. 


Viemueeree cL Salistaction a “User derives 
trom being a member or tne rilitary will 
ECumd “sce bereecoent | HOnmdc Gur acy . 


Service Satisfaction has an affect on 


BeCOftt hb. On eceCurdacy., 


The ameunt of previous computer experience 
a “Usenoemacmeadd Will NOUsalteecterecognition 
accuracy. 


Previous computer experience at heeys 
FECosarllOu aCourdGy- 


E4 





Ocepereweymeine dad roreren tangiage (bi- or 


e multilingual) will have no affect on 
recognition accuracy. 
Hy: Competency in a fcreign language will 


affect récognition accuracy. 


ee. Meo Function 

The restlts of the experiment for users with anda 
meevsOuULmericroL~aone experience are shown grapnoically in 
figure le. Microphone experienced users tared only slightly 
Pete nwmowen NMOn-nicrophone Experienced users. The analysis 
ct variance (ANOVA) results in Table V Substantiate this 
Emowinegween £ ratio of .é7% indicating no statistically 
memitveamt dittference in the user’s job tunction. Thus, 


Miami lim POpMeSIS Camnot te rejected. 
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mesure lc. Mean Error Rate vs. Job Function 





TABLE V 


ANALYSIS CF VARIANCE FOR RECOGNITION ACCURACY 


SOURCE 55 ag MS i 


HOTAL 7329E.0L 121 = = 
BETWEEN SUEJECTS bAQEZ.EU 43 = 


Micromenone 

Experience (MIC) 6$6.61 i £36 .&1 Ben 
Training 

Method (TNG) 5629.5 1 eeco.c0 &.8E8 

MIC x ING Peg 7C9 1 L7O9.69 ioe w 
Error(b) 46Z5€ .E’ 42 1166.41 = 

WITHINeSUBJ ECTS 19213.41 RE 2 = 

Trials (TR) 4324.19 Zee ce 0S at -C9e 

OR x Mac Pence 2 en shor 

TR x TING 74.82 Z Sa ee 200 
TR ESL Cwxc ING Leeee Z 6 cue 35 
Error(w) 147&&.42 SE 184 .E5 = 


[ ** SIGNIFICANT at p < .@5 ] 
[ NS: NOT SIGNIFICANT for ¢ < 8.95 ] 
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incr Otten EEemeri€nee: kxperlienced vs. Non-experienced 
Training Methoa: Supervised vs. Non-supervised 
Trials: week #1 (Words 1-100) 


week #2 (Words 120-1) 
week #Z (Words in random order) 


alc 


NS 


fee 


NS 








Mean total error rates for microphone ana non- 
microphone experienced users is summarized in Table VI. The 
aefinitive decreasé in Error rates vy time will be discussed 


latér in the review of operational characteristics. 


eee Valve 


MEAN TCTAL ERKOR HATGS FCR JCB FUNCTICN EY WEEKS 
(in Percent) 
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Oouwmemanch Of Service 

Three orenches of service were represented in the 
eee ecurr With Civilian sWbjJEcts categorized as a fourth 
Oranch. A Kruskal-Wallis test for k > Z samples was used to 
determine if any aifferences existed. Table VII proviaes 
the synepsis ot results. The gull hypothesis, that odranch 
momeecermmer wili rot affect recognition accuracy, is clearly 
Pe ject ede Multiple ccmperiscns were made to determine 
between which pairs of means the differences occurrea. The 


results of this test indicated significant ditferences 


betweer Army/Navy and Army/Air-Force. Differences between 


&'7 





Civilian/army, Civilian/Air-fForce, Civiliazr/Navy and 
Navy/Air-Force were not significant. 

Further inspection of these resuits indicatea 
possible confcunding due tC experience wito voice 
mecOenitien equiprwent. All ‘Air Force persommel ana ¢ cut of 
€ Nevy perscnnel were experienced users. Segregating the 
experiencéd ana naive users into seérarate categories and 
iene Temomdmwecting ithe analysis fer affect by brach of 
service showed nce Statistical signiticance (Tatle VII). 
Using tne original hypotheses estatlished, the arll cannot 
pe rejeq@eed in eitner the naive only or ezterienced only 
GCe=es. Weeaen error rates by brench of service for ell, naive 
Sealy ata Gmper  ,euced oe ewolvects, are presented 


papi Camy in Figure 13. 


TABLE VII 


AFLECT BY BRANCH OF SERVICE 
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RATE 4.0 - 
; BXPERIENCED 
3.2 - ae z (3.78) 
ee ee ee ee 
2.4 - (2.85) (2565) 
a 
a |S Sasa aan aoeaee 
Civilian ATHY Navy Air Force 


Pigmre le. Mean error Rate vs. Branch of Service 


4, Meo and Service Satistaction 
Beojects were divided intc tfcur ereups based uper 
mietr Siemective responses and igcluded: 
a. Persons who disliked tneir jobs 
Gmpeeacse WhO were borderline Or neutrel in their 
teelings 
ere ine@ividuals wno lixecd their present job 
femeerersons who inditceteéd 3a very definite Liking of 
Pitta Lited tarir JOpeVery fuch 
Tae attained test statistic (Tasle VIII) leads tc the 
GeletstOgmevoay the null hypothesis cannot be rejected. The 
Correlation ccettficient between the two variables was not 
Sigerificgant and it is concludé€d that there iS no apparent 


correlaticn petiween the satisfaction aiuser has for his/her 





TABLE VIII 


ALEECT BY JOE/SERVICE SATISFACTICN 


$e- eee toe eH +H —-—-—--——- - — - -— + 
| MOR sn ACTION jMSERVICERSATISFACTION. | 
| - --a - teem - HH- Hee toe —- = - 
itType ot Test ; Kruskalrwallis | Kruskal-wallis 
poe eee + —-———- tao 
| Alpha} OE ) 1 ! 
procooo-------- ¢------—-—-—-—--+--+~—-- fone eee ee 
Tece | | ! 
Pe otatimetic | 4 .EQ ee 
poo oce neon tee ---—- —— — — +—-—---——-~—— ~~ 
mec r i t meal : 7 
| Level | 20 2 30 | 
pocceecr en - +—-~-------—---~---~—- +--+ --- ~~ 
| Correlation | 
mitoeffiient | Ome 041 
onteetententententenientententetionton +e ee ee ee ee +-—---—-— | 


ior entayoco ana Wow well that user will pertorm with vcice 
mecOeniteson Eequirmrert. Ditomeert Cy van seuningn fector Is 
mevertbemess wOrthy ot further Examination in the future in 
memes MOreetserS wohose current ‘et entails tre day tc day use 
Of vOice Equipment. 

meeoemanelysis Of tne atirect service Satisfaction 
eee CME eC GT tTicnh accurecy, the ¢° civilians were removed 
PrOm™ tne semple poprvlation. Subjects were now divided into 
Perec Beeeups tased upon tneir subjective respenses and 
rc lud eae 

a. These whe ere unsatistiea or don’t care 

bd. Those who are reasonably satisfied 

e Those who are very satistied WLt d ieee 


respective service 


GO 





Imemveste statistic (Table VIII) reveals no significent 
emer mee between S£roups and tnerefcre the null hyrothesis, 
that the degree of satisfaction a speaker derives from being 
ja ike farmed services will not affect recognition accuracy, 
carnrct be Renee hed . Correlation between service 
satisraction and total] error rates, as before, was not 
Seyeeiticant, thus indicating little or ne correlaticn 
feoneen Gae random variables. 
o. Frevious Gcmrputer Experience 

meee ctS were subjectively divided into four groups 
pased upcn Their mespense §to Guesticn roe in Wiser 
Se sStponmaire #1 and included yperscors with: 

ae re ex perlerce 

db. Very Little experience 

Meese e Cr TOcCErate Experience 

d. Ccnsicerable experience (data processors) 
MemeenamesssS trcviced @ test Statistic (Table IX) which 
meee teCume a thee re j€ctLicn “ot the mull aypothesis and the 
Pome iucmerca TNat Previous cOmputer experience will affect 
mecocNigmon accuracy. VULC to meaonanmisOnSse were. performed 
pomoeteC@wine which oairs of means differea. Seem Peat 
(emer eremees OCCUrréed betVwEEN users with, mo and consicerabdle 
Peer cme, very littie and mcae€rate experience, and very 
ey vle and considerable experience. These results 


demcnstrate that pcessession ot experience with data/xeybcard 


Pao uememeoceaures provide a higher ré€cognition accuracy. 


el 





Beplanation tor tnis cccurrence may be attributed to, ‘Yor 
efample, a data pfrocessor’s awareness of the tire involved 
tor manual entry and the associated error rate as well. The 
eavantages intctwevOlce w= tnpuL Offers sito those computer 
peeeerliemeca personnel may well be a psychological on § 
Metivativenal factcr in addition to its presence as an 
Meeupativemal characteristic. 

Meese results are tturther substantiated ty the 
PePouLedmecOrrelation coefficient. Ferrorring a Oone=—teil) 
moet OGmueaative correlation with the @Zistence of mutual 
independence as tne null hypothesis, we were able tec reject 
Pars fyegemnesis and coOnclude€é threat eS computer experience 
lucreases, récognition error retes will decrease (Critical 
bevel: << .9@1;. Graphical representation of meé€an error 


meee S ©Oee tne tCOur Zroups are shown in Figure 14. 
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Figure 14. fMedén srror Rate vs. Computer Experience 


6. koreign Lénguage Competency 

MecOsnition accuracy was ccmpared between two 
eee osSgummraose With a fluent proticlency in a foreign 
languege and those witzort. Cem Su O;eECtSan POSSESSed “no 
Capability in a se@ccona lang.age, whereas 11 were competent 
Bimone wOmerore laneveses. lhe méediér total error rate for 
moe eme@ures was 6.224. A two-samtele non-farametric test, 
the Mannewhitney, wes verfcrméed tc detect the existence of 
ey Gite@crences between the two grcurs. The computed test 
Statistic (Taole Xk, clearly shows no signiticance at the .¥5 
maovel dame taererore, the Muli hyrpotnesis cannct be rejected. 
moe wchemercal regices for this twc-tail test included values 


eel oeme@es tt SLALGISt1IC less toan E72 Or greater taan &i4.e. 





TABLE X 


AFFECT OC} COMPETENCY IN ANCTHER LANGUAGE 


$—- 2 -- -- -- - -- - ---- +o_------— - - - ee . 
| FOREIGN LANGUAGE ! 
aia a TOO ee 
| Type of Test Mann-Whitney 
| pee en ee pee me ee 
| Altha " ¢.2S ; 
Jor rer err ee ee ee fer —------ - -- - - - - - - - - - - -  - 
| TEStmOvatiStic woes 
a> aa lO ian Aaa aaa SS Se 
GCritmeal Level one : 
i { 
a — =. ll = Se “SS a a al lemma 
| **¥ = Significant at statea ievel ot signiticaace 
pe ee ee ee em ew we em ee eee we em ww me ww we mw wm ow oem ee ee + 


Cee OP mma IChAL CHARACT2ZR21STICS 
1. Hypotneses 

eo Se ae 

The following hypotheses apply to the operetion4. 
BMaraClLertStics Under which the sutjects were testéd. 

ie etemmeuwnod Of Trainene a user for voice 
recognition operation (supervised versus 
non-supervised, will rot affect recognition 
ACEUTe Cy. 
Metaodnot tratwime willmattfect recozariaen 


accuracy 


oe fee: The time of dey in which a user trains the 
equa pment will LOL merece beer eCOeni Tlou 
gecurdacy. 


Hy : ReEeocnhition decurecymor the user will be 


atfected by tne tine of day in which he/she 
trades (ne Vence Teccgeni Zer . 


YS 





mo H5- lhe period Of thé week in which the user 
trains the equiprent will not attfect 
PECORZHITION eccurec;. 


© @} 


Paeweperlod “Ci | the week “ih "which the 
equipment is trained will aa cect 
RECCShi lon accuracy. 


d. d4,: kExperieuced users will acquire the same or 
Srecuer Error reves tren inexperienced 
(naive) users. 


ir 


Experienced users wilt have lewer error 
rates than adaive users. 


m3 
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MECOCMIEULOR deCuracy will a0t ©§e aitfected 
by weekly experience. 


A user will demonstrate reduced error rates 


(decreasing trend) as experienced will 
VOICE FECOZNITION EGulpient increases. 


Es ip: ACO natholcdl ee coe nto which voice 
beGeoihilon EQGUIDTeO Db aye te sused “will have 
Nowet tect ObmereCOZMi tion accuracy. 
H, Zase fees ©. ates ea Reece L Pe CO hia Ou 
Aecvracy. 


Aomeetnca of ireining 

Bilemrecul,S 0: the gexngeriment ior users receiving 
Elther supervised or non-superyvised training are depicted 
Peo nveamery in figure 15. Users who received supervision in 
the treining woae fared significantly better than those who 
did not. The analysis ot variance tatle (ANOVA) in Table VY 
Sue Leveero te this Claim, providing an F ratio of 4.&€& and a 
eolvicammee1evel ct approximately .¢é5. Pius... Stine: wie 


Peetieamm=> 815 rejected and we may conclude that the method 


Poeulaimumeme does aifect recognition accuracy. Mean total 
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Summerizéed in Table I. 
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Oo. @ere ot Day and Week 
subjects were blocked by time ot day; morning and 
arternceor, and by time of week; early (Monday-Tuesday), mid 
(wednesday-Thursday) cr late (Friday-Saturday). A Mann- 
Whitney test was performed to determine if differences 
existed tétween the two time of day groups. Morning users 
nad é& weadian error rate of £&.1% while afternoon users had a 


.E7% error rate. Fecauseée oft equal sample sizes, a 


M 


Parametric iu-test was performed <to confirm results of the 
Ome edredieetrs’c test. The presented in Table AII will not 


Seeeewo Uenw t) reject the null hypothesis. Critical regions 


++ 


‘or tne Mani-Whitney test included values of the test 


in 


Sacis vmeevess then 411.3 and greater than 57@.68. 

With toree groups in the time ot week variable, the 
analysis utilized the Kruskal-Wallis test for determination 
of differenc=s among the groups. The nell hypothesis cannot 
Pomereened With a test statistic less than 5.99, for the 
Chi-sanareé value with two degrees of freeaom. The 
merereianmon cecersficient was sound to be significant at the 
0.02 level in é test for negative correlation. A prematvrre 
momelicmem that training occurring in the latter portion of 
the week woula yield lower error rates appeared to 0be 
cour tverarotuitive. Tt was thought that fatigue, and 
interruption of a weekerc would result in poorer training 


Pa OrusSmeand nence l@€ad to high@r errcr rates in the future. 


Upen further analysis, this reversed ccrreletion was found 
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momebe Meee result of possible confcunding arising from the 
baree numver of experienced wsers who trained in the tater 
period of the week. Hight out of thirteen late week users 
weme @€Xperlenced and with their reroval from consideration, 
the correlaticn bétween time of week anda total error rate 


tecame statistically non-significant. 
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ArSeECT OF TIME OF DAY sAiND ERK 
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2. User Lxperi€nce 


MuooeecetsS == or hypotheses in Section ¥.C.1.d are 
PegereOmemed Intec this paase of the analysis. The enalysis 
PC newmmmrst Sel was pertormed using the Mann-Whitney test 
end the essociated results are summarized in Taple AITI. 
Weewedmen e€rrcr rates tor naive users was 7.2&4 while 


empericmeed Users attainéd a2 2.754 error rate. Both groups 








Mmeecaucmmolmbers Of supervised and unsupervised users. the 
correla teen Soc. -1Cl1 ener Yeelvded Py eOne Mewes tne “Strenzest 
correlations between two variables within the experimert. 
ise null hypothesis can be rejected and it is therevore 


Per ldeds thet Experience will affect recognition accuracy. 


TABLE XIII 


ANPECT TUE TC USER EXFERIENCE 


ee ee a ee + 
EXPERIENCE 
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| Type of Test japo-Whitney 
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Tae analysis of the second hypothesis of V.C.1.d is 
marci cdmerrarhically in Figure 16, (Triels by Joo Functica) 
and Figure 17 (Triais by Training Methced). In each case no 
Pee raemmed 15 Eresent, with the weekly error rete Showing a 
pecdyma@meop Ct aprroximately .@ to 1.4% each week. This 
meapiiicamee inverpretation iS proven statistically in the 
mOWR presented in Table VY. That is, the F ratio is well 
above the 3.11 required for é€ level of significance of @.@5. 


PopeenUieay Otnesis is rejected and it is concluded that 
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eipure 26.  —Triels versus Job Function 


(E.61) Week #1 


1 
E.G ~| 
(7.ci) Week #2 
2.0" 
! 
I 
eee |) (6.21 } (E.0@0) Weex #3 
MEAN % erga, (Dice | 
ERRCE | 
RATE 4.0 -| (4.17) 
J 
{ 
o.2 -| 
i 
J 
2 a 
i 
7.0) 
I 
I 
aa I a ee oe 
supervised fon-Supervised 
Ura is We Eralning 


Mrerre 1’. Irliels versus lreaining Method 





Meets wie improve (reduce) their errer rates threugh weekly 
iteretion. Tas Comenius i107 wes further verified oy 


Mepeicatmenr ci the Cox GG otvwart Test tor frend. eae 


fy 


following cOmparisons were made retween: 
a. week #1 and Week #2 


week &d 


nN) 
Oy 
tS 
fu 


Db. Week # 
c. Week #1 and week #2 
mrecll Waeee Cases, the null hypcthesis, that there is no 


qdOwnuwerd trend, was clearly rejected. 


SD, Mase o: Use 
maoed €N Sub JECtive resconses bY those participating 
Devoe erinfent fOur igercecurs were categorized. They 
mie lude: 
Ecce isewNO CONS1Cer volce recognition Equitrert 
ier eGill lt oe hls e. 
O, inese who had no Obinion elther way. 
mepUscers We Slated that veice Equipment is €asy te 
use. 
oe 2nese who feel thet voice recognition equirrent 
omc e Ceyeccc) hou Wct. 
megensresi@eers Of t4515 analysis are surmarizea in Taodle ,#1V. 
mae Lest Statistic 1s less than tne Chi-square value of 
Pee co Werh three degrees cf freedcon and therefore tne avll 
Pamioectm@eerejected. the computed correlation ccetficient is 


meu Sle@meticadant at the 0.05 level. 
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TABLE AIV 


AfE2CT DUz TO BASE OF USE CF VOICE EQUIPMENT 


tome mw wn nw nme wre rere tn nr nmr nr rr nr rm renter rrr ese ee + 
I ASE Or tS a | 
es es es ee ee I 
Sere NS Car ea Mc ee aCe gc eh 
Type of Test Kruskal-Wellis 
ae — ae ee | 
Alrvhba | g.25 
= — a $m mn me ne wn = nn nw nn nn 
TES WESiLa trstic ee 4.814 
ee ee a ea Te ee ee 
: Critescal Level Preece 
jotta ere oo POSSESS SSS SS SSS SS SS SS 5 SS SS 
Correlaticn ! 
Coemricient eae : 
aan "177" ~~~ To a oat gS ee ee oe ee ee | 
| 

1 


Sore nwitlcdat dtl Steted level cf significance 


BaeeiOlloOwene nyrotheses were tested pertaining to 
mies oersemel characteristics cr vcice recognition users: 
Ge eS nace ssOt ~ Tae. user will not ariecer 
ReGcCenrTLoOm -acourdcy . 


Men tT Geren ce Va reCOlUt er On accuracy eExiets 
HomwecotmuUsehSomOLmdlo rene mt face. 


{53 


mess IN Mapitel status Of the user will not 
dime chmrecosuTtlon aecurecy:. 


* & wser © marital status will have an attfect 


at Sa 
on his/her recognition accuracy. 
PecomomG: pcwicen Sm canilyeuwiir net at ect 
FECOENIt1ON accuracy. 
Elan. Hatilyee sol Zeamivitiee Dave: san atrect on 


reccgnition accuracy. 
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The religious preéeference/background of é@ 
Eser will have ne affect on his/her 
Bec cent ylonmaccunadoy . 


A user's religious preference/background 
Meni duismeeu TecOehi lLon accuracy . 


A person’s eccent will not affect his/her 
LsCCenreclon accuracy . 


mecent ali@eys ©ecoznitlicneaccurdey. 


The place ot birth cf auvuser will have no 
ceeuce ween TeCCED1 Elen accuracy . 


One’s place of birth affects recognition 
dicen cicy 


The gécgraphic crigin of a person will not 
Pec Gwio OG enet TeCORNI Lion accuracy. 


heeeecen S Trecernition  sacciracy will te 
eaeifected bey gecgrephic origin. 


Tre level cY e@ducaticn an individual has 
atteined will not atfect nis/her 
NECCeNatiOn aCCLiracy. 


Ou GavwlOms Level rG: a ioe r ae eC us 
Mectearrnon eCcCurecy. 


Tne Socio-econoric class or aiuser will not 
Meme cu TeCOERi LON accuracy. 


BR user s recegnition accuracy wili te 
a CCUCUmoy= SOCIC-CCOnOMmlCceciass Standing. 


Pact "Oba seTreer, sor orthodontal care will 
MOtesat#hee, TeCcoeniticn accuracy of the 
seh. 


fPeCceemetlOn accuracy Ot tne user will | te 
attected if he ocr she hat wndergone oral 
Surgery or orthodontal care. 





2. hace 

Iwe racial backgrcevnds were represented in the 
Sampled mopuletion. Thirty-elept Caucasian and sixvegro 
Sep jJECtS = barticipated in the experimentaticn. The median 
pebel Ermer rate for Caucasiam personnel was E€% end €.8&% for 
MeerO USerTS. & Manno-Whitrey test was performed tc detect 
Be  preserce of any difference between the two groups. The 
Pomewleteme LEST Statistic (Table AV) was net significant at 
ome -9>" lewel end Ghe muld bypothesis cannot be rejected. 
MeeiGedl meesiONS ror the test statistic ir this two-tail 


test were values less than 797 éana greater then Yilc. 


TABLE XV 
AEPRCT OF RACE GN RECCGNITION ACCURACY 
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OS. @eetital Stetus ana Family Size 


EEE OE ee De EE Sn 

The sample popudation consisteac of 14 single, é£95 
werriea, <¢ aivorcéd, ena Zz ctker (se€parated, widowed) 
bercompem. A Krmuskeal-wellis test tor k > 2 samples was tsea 


POMdeteCmmrne it any differences in means existea between the 
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Pou o. meee cause the cOmputed test statistic (fable XVI} tis 
mess Ulam ¢.€15, the tabdvlated chi-square value with 3 


mererccS Om freedom, the null bhypethbesis cannot be rejected. 


%, 
a! 


iomeecOrmeration coefficient was computed for marital status 


G@té to the nominal scale of réasurenrent. 
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ever STATUS AND BAMITY SIZE 
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(rmoaipie pcpulatic“. Sutd@ivided | intc “2ive groups 


for family size with a range fror no childre 
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rome ine Or fore chilarén. Ae hrUcrabeewailic TESt = was 
Erasmeoe teed ) tc aétermine if a aifference existed and as 
foeorcemmerae pull hypothesis cannct be rejected. ae 
COMputed Gerreletion CO ig Glen tndicates ruben 
magepedamemee bFEtween tamilly sizé ard tcta! error rate cf 4 


merce Peecenition user. 
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4. #@eligicus Freference 

meeooveh a diverse variety of religious rreferences 
mere enumerated by participating SURIECTS, SCme Mere pecled 
Perec NUreroOus Samples sizes of jvst one person. Por 
example, Methedi¢t and Episccpalian were cemtined intn the 
PeoemmeStamee caterory and so forth. In ell, six groups were 
represented drdeeeciudcede Calheinc, “Pragestant, Jewish: 
Baptist, No Preference and Others (these who covld not. be 
readily grcuped into one cf the atcrerentioned catezcries). 
Mmeeee the weATruskal<sWallis test to echeck for differences 
between reans, the Obtained testi statistic (Table XVII) dces 
ieee fOr the rejection of the rnll Ie CO sic eomcer 


mete: Ores 1t fay te concluded that the refligious preference 


of the user will not affect his/her recognition accuracy. 


TABL2 XVII 


PPE ECT OF WeemiClous PRY ssRancs 


oe a See ee re nr eee ' 
| I 
Dyoeeot Test Kruckal-Wallis 
| ----- + +—-—---—-—- ~~ - - - + - + + + 
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Ten subjects possessea sore type of noticeable 
accent, as ceterrined by the subject and experiment 
Semin istmator. Seven were southern and three were 
Categcrized as GHEY “onan icoh. Bostonian). Remaining 


subjects were vtiaced in a “No Accent’ group. The recvultant 
TEST Statistic (Paple XVIII) was slightly less than the 
tabulated Chi-square value cf 5.991 with two degrees cf 
freedor. meompouce, the null hysothes®s cannot be rejected. 
Mamcdeaditmendi check was acccorplished by corbining the tne 
accent groups into one eeneric entity and performing a 
Pann=Whitrey test to detect a ditference between te two 


peeoepS. weecair the null wypetnesis cannet be rejectec at the 
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Boevea Wewe)] of Significance. Correlation enalysis wes rot 


erformred due to the neminal scale ot re€asurerent. 
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ARPECT OF ACCENT ON RECOGNITION ACCURACY 
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peeudovenmeooe Dull is not rejected, the critical level is 
ee sici@maiy close to the stated level c? signi“icance. 
Meus, nean Crrer rates are illustrated in Figvre 18 for 


Pucthner @ehMamination. 


mine | 
meee = 
s.0 - 


e.¢ - 


MEAN & Sag 
ZRERCR ! 
ae rn = e ¢ ot 


) 
GS) 
{ 
eo 
mS 
CN 
al 


Cac —— SS SS = a ee ee a ea eee cee me ee ee me oe om oe oe ! ee ee ie ee oe ee 
Ne Accent SOUtCKheETD Cther 


Figure 18. Mean Errer Réte vs. Accent 


S.  $tace cf Birth and Geographic Origin 

Pemecets Were eSked tO previcde thelr state of birth 
ead tneig. eee eee Wore ISuCSeQNeni wc lasSiiled into One oF 
Pew Cmuenwene Six veneric groups: 

a. Qverseas 


er, Northeast United States 


10& 





Gee ocuTheéest United States 


de Mid-Central United States 

e. Southwest United States 

je Western United States 
Meeeryinemene Kruskal-Wallis test to the corriled data, the 
Poe rmedmetest <stetistic (Table XIX) is insrfficient te 
mevect Gee Stated null hypothesis. 

RPecausé 4 person’s place oY dirth is not necessarily 
the environment in which thet individual grew up in (ie. 
during eses 2-18), data pertaining to geograrhic ortgin was 
eeeeoe veeped tO dé€termine if any negative affect would be 


encountered. The geograrhic areas used were the same a5 


su 


peace “weer Cirtno. Calculated results pcirt tc the fame 
mamemlenon, sthe rvuli hypothesis of S€ction V.D.1.e. cannot 


Pome |e cred. 
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Peroc td CF PLACE Of BIRTH AND GECGRAPHIC ORIGIN 
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Co Level of Naucation 
aa ae ee 


<—t 
Maer 
0 


The sarrpled rorulation Port. oon ed poe 


follcewing five categcries: 


gee high School gradvates. 

Bee individuals ‘with 1 to 4 years cf college tut nc 
degree. 

c. College graduates. 

d. Individuals werking teward a graduate degree. 

Smerersons accorded a graduate degree stch as é 


fae Gers, cr Loctorate. 
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gme cata Ottainéa trom the tive grecurps was 


ct 
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zOr any sSigrificant difference between groups. The 
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Uarcisticeiteble XX) leads to the rejection cf the fv 
Pee unesess and the conclusion that level of educaticn 
PweecctSumnc OvE€rall error rate for voice recoznitior users. 
Peeecewceuavery™ strcne cvositive coarrelatioc exists witha 
Secicdm1evel of &.00€E. baat wiley aS otme indivi dive 
miereaseam in Level of education, a concopfitant decrease in 
peer Cdae Cccurred. 

Miemumeile cOmparisons between “the variovs groves 
showed the rredorinernt influence te be graduate students. 
Mamie CT merdriration indicated possibile confourding due te 
that POUL S Prien experience with ~Yolice fecorenition 


Cavuipment. SBleven cut twelve graduate students were 
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TABLE X 
AFFECT OF LEVEL OF EDUCATION 
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PIS ewoeese hs. Lnhese Experienced users were sztripred 
mueeeeci mmeca@e. Satple and the Kruskal-Wallis test applied tc 
mee suo lermemuUnat Were naive tO Voice technolozy. UsimZeraue 
Pome hypetheses, the obtained test etatistic dces net allew 
Mee uiemnre jection of the null. Taaicg aides Une sere corMpn ted 
correlation Coen. i cient cerrotcrate te Meo hy On 
memroun@een= end the earlier conciusiacn iS mow amended tn 
Peoeen meat ievel of education will not affect reccenition 
Seecuracwmmetiean error rates for all education levels are 
eowiecmeeabcaliy in Figure 19. Error rates tor both, total 


Sarple poruletion and naive users only, are incivded. 
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en earner Se a 1 fo - 
High a College Grad Grad 
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weavuTe 29. “een Error Rete ws. Education 


G. “mcwe-eccnemic Class 
A AT Teg on ee 
A veriety of sociso-econoric clesses were presented 
mimeo nhemecenrticitarts tar selection with one of the following 


mere Chosen ty Gach subject: 


“meeceer lower class 
ie «6h LOWOCT Tidale class 
fmertdadle cless 
pemeeurrer fiddle class 


e. Lewer upper class 


miematemeci Ss Of tOtTal error rates fOr these five grotrrs 
Glaote WAkAY; yielded a test statistic that would not allow 


eee tiemeemie ction of the null hypothesis, and it may be 


ie 


ae _ 
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eouciuded thet SOC 10 ~eCconcmirec Ces sme O Gr seit 2cy 
mececnitwen accuracy, The negative correlation indicates 
ot lW@amevidusls of a lower socio-economic class tend to 
meguicemeeeher Error rates although the coefficient is not 


Sieniticamt at the @.@5 level (critical level: 9.186). 


TAFLE XXI 
Pee Cron SOCIO —RCCNCMEG CLAcs 


pee eee ee a ee tm em - 
| ! SCCIO-FCONOMIC CLASS | 
ee ee I aera NCR 8 cee cep Sa i cs Ue ' 
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, Type of Test Kruskal-Wallis 
ee ee Ue a a a en eS | 
J 1 
| Alpha @.@5 ! 
I ee ee pee ae a ee ee eee ee | 
| i 
Testmametistic oe 7 
[ooo ee $e eee ee a 
vPitieal Level 25 
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Correlation ! 
Coemricient 7 -2.152 
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Teemoretticant at Stated level ct significance 
a ee a oe ee ce ee ee ee a a a a a ae a as i ee a a 
os Dental 

Peeiecuoewere Guericd aS te taeir Histcry cf dentel 

care, ix Pargucelenrwroral Ssumeerywadd/or orthodontdl 


memerectm@eGe Iwc grceups resulted vocn whose déeta a Meann- 
Meerdevmmmmest was Perfcrmed to deterrine if any difference 
existed tetween them. Lae en Ween VeChResi iS | cennect he 
Memectc@medue tO the computed test stetistic™(Table XXII). 
Memrredim—mreclOnsS tor the test stetistic included valves 


ETecvermencn (14.E€S and less then €2E&.21. 


ie 





jiy:0-3 tp © Oe! 


ee FECT OF PAST AND/OR PRESENT DENTAL CARE 


pene eae ee eee 4 
DENTAL CARE 
a tae ee ae oe 
| Type of Test Mann-Whitney 
a ——— 7 7 Sani a —" aaaaa  e e 
! Alpha | @.25 ! 
ee ee Ratan aa Oe eee 
ES tao TaAtiStic Ls E72 .5¢ 
2 ae ms I 
Va —<- i eee sae Go = oe eee ee ee 
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ee SP 
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meee, PHY STOLCCICAL CHARACTERISTICS 


1. Hypotheses 


moe  toOllowing byrotheses frertaining to Vd Reseles 


mavevowoetcal characteristics ct voice reccgniticn equipment 


Mero Were tested. 


ae «| Highs 
A 

b H 
H,: 

c an 


Myeeuser «sm, age will net affect nis/her 
mec Como Or accuracy. 


Pemv seme Cot Gem LOUdlume rT Oreos rd LES sc 
Weeks OL evelce recognition equipreéent. 


The height and weight of an individvel 
Usine "yowsee technology will rot aifect 
vera TeCOZNA lich acCuracy. 


ReceenT Cleon ecCOUncCu will NC enfected (oy ean 
individual’s weight. 


Priemiiuals Capacitymend rate of cir flew of 
Qe user whl) DOt atfect his/her recognition 
acc iumaen 
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i (ECON monmcceuracy will The agifected by a 
DeGSOMeSs Niteal capacity » and rete cf eir 
TRO Ws 


GQ. H,: The overel] physical condition of the user 
will not atfect his/her Pecos till i0n 
acOunadey. 


Memehecoemi tion accurdcy will saifected by one s 
bibs reat. condi tion. 


Deoeeecrmal Speech and/cr weaice treinine will 
DCUeetLTeECh TECOERL Ton accuracy. 


ee A user's reccgnition accuracy will te 
effected by any formal sreech or voice 
buat ning/Eneraty. 


Age 


N) 


The sutjects ranged in age frer 2G to 47 and were 
memo ccm~mree bCa Rive frcur;s tor purpeses of the analysis. 


These groups and their m@an Error rates are: 


ie ee te 24 §«6(4.,E8%) 
Mer 22 te fe (7.02%) 
mec tO <1 (7.152) 
Meee este 25 «(5.752 ) 


Ee. Z6+ CGO 


gece ime ercurs were tested t0 detect Yor dittferences 
erone their means. The ottained results (Table XXIII) show 
moceeUiemarr tl 4yYroOtnesis, Stated above, cannot be rejected 
and that the two variables, age and totel error rate, ére 


iuted Layer derende it. 
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TABLAS I LI 


AFFECT ON RECOGNITION ACCURACY DUE TO AGE 
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S. Medet echt andeveiznt 

En ES 
Beye cts ranged iz height ‘trem €@ tc 7? inches. 
Mere erewes were generated for analysis and are listed Telow 


Meeen ther resrective rear error rate. 


inches (&.4€%) 


yy 
Pe 


ae Of to 
b>, 66 toc €9 inches (6.67%) 
merc to fe inches (5.29%) 


jee cae vo 77 inches (7.142) 


The results of the analysis, as summarized in Table XIV, 
mcmecatemmepiat the null hypothesis cannot te rejected. The 
Emel p@emrive correletion ccefficient is not stenificant et 
Phew. @oeeebevel and thus the variatles in question may be 


Poncwerered to te independent. 





Weights of the subjects ranged from 118 to 


£a¢ 


Pounds. seexdMination tor some natural “break” roints in this 


fomee Teavited in the creaticn of the fcllowing tive zrcups 


ed Umieige COrresronding mean error rates. 


ewelie to 125 pounds (6.4&%) 
Melee to 145 povnds (6.252) 


ame 126 tc 175 peunds (5.172%) 
dum 176 %0 199 pounds (7.1EZ) 


e. 2¢¢+ tounds (&.8E2) 


mee nudieny pothesis cannot be rejected, with the correlat 
coefricient indicatixze TECevenCe nice between tac 


Variatles. 


TABLE XAIV 


AFFECT OF HEICHT AND WEIGHT ON RECCGNITION ACCURACY 
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ire sSingslarity in Test Stétistics and correlation 

coefficieénis cf height and weight may be explained ty 

coserving tné correlaticn between height and weight itself. 

A Pearson predvct moment corrélaticn of .821 svezests a 

Streng positive associaticn between the twe variabdleées and 

thus serves to confirm the similar resvlts of the analysis. 
4, Vital Caracity and Rate of Air Flow 


The vi 


et 


ai Camacity of particirfetine subjects ranged 
meen seme tO O25 cuble centimeters. The foitlowine “cur 


Proups Meme creéstéd: 
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A. Poul 1 Orme Cubic céntimeters 


ae Peo towerC?’ cubic centireters 
Oe cdc | 6UteM45O cubic centireters 


at 
aS 
NJ 
un 


mn MA65S to cutic centimreters 


Analvsis for differences between the means of the vericus 
groups generated the test statistic (Table XXV) that 
Zh ve@mm no une re‘ectior cf the null hypcethesis. A 
wounelagmon het~ween increased Vitel capacity and low error 
mec Weer. CUnRd to be significant using a cne-tail test ‘fer 
mereviverrorre!ation (critical level: .g@45). 

Me rate of airtlow characteristic had a range of 
Eoeeeetommeyecl liters per minute. this range wes diviied by 
moe dideepne tollowing groups were usé€éd for the analysis. 


moe feereincluded: 


Ae 





me 2ie tO 657 Liters/mrin 

te OS< te 460 Lliters/min 

c. 4€1 to 599 liters/rin 

dq. 6¢¢+ liters/min 
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moe dilemma of a non-significant Kruskal-Wallis test 
Bmgeeecd Semenificant correlation coefficient can only be 
Sopeeim@emmcy see SubIective divisivon of the rang2® of flow 
rates into the grcups used fcr the analysis. Riased 
Eeouuing = could preyvide a matrix that would yield a 
Sienitficayt test statistic te shew a ditference tetween 
Means OU in the final analysis, creditility for this 
Ciemmdevermsric aS a deterrinant in rersocnnel selection would 
temL cs te 

oom sical Condition 

ieee £65OursS resultec frer tke <ubjects’ selt- 
euro aieeot stheir general chysical condition erd include 
Categories ot tair/roor, averag2, egc0o0d and ovtstanding 
miave ically conditicr. Their tctal error raies were examined 
omc evemmeme if e ditferenc® between the groups erlstec. 
The results presented in Table XZAVI de net alicw us te 
memectUmoene OUulL hypotnesis. Bcc ti oleie | a . neglizitie 
Poerolmmmen cceizicient presumes the twe variables te te 
independent of one another. 

PeeenOUen a SUDJECTIVEe response was the determicant 
Pet weememenardcLerictic, seven subjects who hed colds, 
Peened@mmee reccenizer. Their condition was such, that e 
Mee tinG@eerdsality was present «while they ¢pcke. A Mann- 
Weevoevetest was Eerfor@med to determine it a difference 
between the healthy and ‘cold’ groups existed. The test 


arastumem of iTable XXXVI ftirther verifies our previous 
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conclusion, CeMetteecannet te pejected. The criticel 
regiors for the Mann-Whitney test correspond to values 
greater than €Sé.&6 and less than 771.4 

Med iiy, the analvsis for affect due tc formal 
speech Beerdaty Or volce training resvlted in a test 
Statistic that would fct allow for the rejection of the null 
bpeetRecm, thet Sceech therapy cr voice training will net 
Sere ctedmerser S réecoOsenition accuracy. Gritiecl! © recsions 


corresponded tc values greeter than E€S@5& and less than €9&. 
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AFFFCT CN RECCGNITION ACUURACY DUE TC PFYSICAL CONDITION 
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Seer. GOLOCTCAL CHARACTERISTICS 
1. Eypotheses 
ae to: s wuxiety will not attect the recognition 
dCeuradcy O©2- 5a weer. 
HY: Anxiety will affect the total error rate of 


amuse r. 


Gre £,: ihe cocperativeness cf a speaker will net 
Suecweurs, Hera toatl error rate. 


speaker coorerativeness will atfect 
mMeecoean Lion accuracy. 


esos tne Occurrence of reccenition errors wiil 
Momma t ts eCUrOveTal., TeECOSnitaon accuracy. 


ri 


A speaker’s overall error rate will te 
affected by the rpsychclegical intluence cf 
ris- and ncecn-recognitions. 


Mine fo speaker s belie?’s in voice technology as 
ev Oem oc Ue D Becldueewilll Net effeer 
Fecoeni Ticks accuracy. 


Hees | Neeatti tide everson possesses toward the 
influence of voice cn a computer overator’s 
Tomei welche w rh inenesss tO USE VOlGe 
because Oto this ein iverce@Mill arvfect 
RECOLNItION accurecy. 


e Hj: The attitude a speaxer has about computers 
aud nr ormation  rreccessing will heve neo 
ESyeneloFfical diene Gt on RECOL fon 
accuracy. 

Bo: A Speeker’s psychological attitude 


GCOnecerminge “auTOmatica and data processing 
Well aereceu TeCce Nl peor accuracy « 


oe Mesycoological Anxiety 
jee results of the State-Trait Anxiety Inventory are 


Merpecte@meraphically in Figures 24 to 26. Figures ¢4 and 25 


124 





show some indication that individvals with a lcewer state 
anxiety acquired fewer errors. The relationship tetween 
Sweon mauve dnd trait anxiety, shown in Fiegvre 2€, dericts a 
more randomized cccurrence of’ errer rates. Correlaticn 
analysis substantiates this in that State anxiety dvring 
ween tims Statistically significart with week #2 showing 
Bome Pesci ve eee ven DUG eter Sion t 1 ComG cb CGE mG 
Mewetvmimere 15 nO Significent positive correlation between 
iweb eNerety end errcr rates. 

The otteinea STAT SCOTES yielded a normrél 
Gistributicn and e€cual sample sizes of high and low anxiety 
[econ mewn the basic assurptions for use of a perametric 
Pave tems LwC Safwrple t-test was used to detect differences 
femween Farcups. Giese Oni del) Ce me Ol diva r 1 Cant dit 
whitney test was appiied for EUPECSES OF firther 
weritication, ncwever it does nct pessess the vower cf its 
paremetric counterpart. Results of the analysis ére 
included in Tatle XXVIII. 

la ail cases using non-parametric enalysis the null 
Meaaunesmss Cannot be rejected, altnough the criticel level 
remo udtemuest Statistic to be just Within the acceptance 
ee 24.0 Ni. mye wedeeOOLOMY in the trait enxiety analysis is 
ieverest@me, the more powertul paranretric test allows the 


meomiee wimmmee: tne null hypcthesis woaereas the cpocsite exists 
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Figure 25. Mean Zrrer Rate vs. State Anxiety (Neek #2) 
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Mewes co. Mean Srror Rate vs. Trait Anxiety 


rte tieerant=ehitney. In both instances thougb, the tést 
mers blGmmme.es Extremely clese to that peins separating the 
meceobencesend critical rezions. 

Tee t recy Gue tO anxiety Way be comsidered as 


inccnclusive tecause of the resultant stetictctiical analysis. 


Cy 


Peeenecued Sneowilne significant correlation in ‘¥eek 721, 3&ny 
anxiety in Week #2 may have teen cvercemé or matxed dy 
fevevetatiwoeeaga Gxceriénce with fcuilcmedt and procédures. 
eee Choe «hCard «6CU6t Re «administration cf the Preit inventory, 
SUCJECtTS were theroughly verséd io THe exvrerimental 
meeccedure. The inccnsistent results nevertheless, leave 
Por mnempe  16¢Ve that anxiety Has an aftSct on speech and 
Memce  ceeaeniticn accuracy, but the dezree to which it dees 


rereins a cleuaed issue. 
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S. Goreeaker Cocperativeness 





subjects evaluated their degree of cooperativeness 
on an interval scale with subsequert creation ct the 


MOL OW lneme rOurs. 


a. Less than cooperative sreérers 

b. Moderately ccoperative spéakers 

c. Very cooperative speakers 

qd. Extremely cccperative speakers (subjects who 


marked the “anchor point’ of the scale) 


miewresaers ct the analysis are presented in Table XXVIII. 
wenoeimeemeoerrc? rates graphically rerresentéed in Figure <7. 
The nuli hypethesis is rejected due to a test statistic 
meee cum~meenwe: the Chi-squereé value of 7.815. Multiple 
Bompaticcms amcre the grcups reflect an existent difference 
bE TWEEN the “1ess than cooperative’ and ‘extremely 
cooperative speakers only. Despite ixndicati¢n otf some 
correlaticn between high cooperativeness and low error rate, 
boc, COtpmrecd ccefficient is not significant ata .@& level 
Poriticeummencvel: @.095}. 

Teese results led to a further anaiysis trem a4 
perspective of speaker participaticn. havea S did t xe 
awe cwmmeec Carticipating in this tyre of experimentation 
aoe smc CcOULd it the correlated to tetal error rate? 
imeem seme culve responses resulted in the creatior of three 


Pemericeeroups as follcws: 


129 








MEAN & 
ERROR 
RATE 


ae es ee ee ee oes es ee ee = 


Nn 


fA 


Nj 


a — ae ews ae & ae | ear amp @e aw @ 2 a= as a I om eee @e ew GE ee a= ae a i oe er ae ew eee 6 Oe ee 2 oo I oe Ge eee eee oe = oe 


Pigzgure ge. 


~~“ 


Mean Nrror Rate vs. 


18 6) 


cc 


a ad 


(6.43) 


| l 
Less Moderate 


TABLE XXVIII 


(Ener) 


speaker Cooperativeness 


AFEEFCT CF SPEAKER COOPERATION AND PARTICIPATION 
to—-———- —-—------ - tome e —-—-—— — —- — +—-—-—— —— - + 
| COCPERATIVENESS PARTICIPATICN 
}—--—-—-~-~-~—-~---- +—-—- -—— +e — 
Tle pe for test. | Kruskal-Wellis | Kruskal-Wallis 
oe oc a ine Ie yh 
| ] 
Alcha | cs | 5 
| pee +-—- —-- — +—ee ee 
Test 
Staumctic | 16.82 => : 4.76 | 
1 —-—-—-——————————— +—-—— --—- tee 
Cr istkec 1 | ; 
Level < .Oe5 , @95 
poo oce enn +e ee $e eee ee 
| Correlation | 1 1 
| Coerriement | —-.22E +. 27E ** 
pooce-oo- $—-—- -—— = — —- +—-————— —- ~~ - — — 
** = Significant at stated level of significance ; 
toe eee ee  - e - 
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a. Those who don’t care 


b. Persons who like to participate 

Come etLersons who strongly like to perticipete 
In this instance the attainment cf a rpositive correlation 
indicating that those whe liked tc participate acquire 
meee r Ceror Fates 15 counter=-inhtuitive. The null cannot be 
rejected based on the computed test statistic given in Tahle 
XXVIII. A correlation of .€36 tetween subject respenses to 
cocperativeness and oparticipation iS ret as Large as was 
exmeCctedsana 2S such could, in fart, have led to the 
divergent results. Whether these results are due te willing 
Meisaicimgemts trying too hard to perform ewell and thus, 
iarrineeereate> than wusval mis- cr nor-ré€ccgniticns is 
unclear. 

So eecoenition tmrors 

mene ChsS heSpendea LE EWO GQUESTIOLS, OnE pertaining 
PomecCicwmumeerecelings at Tae time of a mis=recognition and the 
femier Vemudainame to their feelings cver a ren-recogniticn 
(beep). Their responses tc these two questions were 
averaged tc rerresent how they felt teward the occurrence of 
Soe chemmecnod this led tO the creetior of two aistinct 
groups; those who don’t like an error to cccur and those ho 
Bee Gmevye are ~mOL disturbed cr tothered by an error. Tthe 


restits of the analysis are summarized in Table XXIX. 





TABLE XXX 


AFFECT OF RECOGNITICN FERCRS 
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| Critical Level . 2897 
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The null hypothesis cannot te rejected eéand althovgh the 
meeautwemmmecoOUrhelaLion eoeetficlent indicates that those whe 
Seeculimwemer ners tend ta Raye higher Error retes, it is net 
Sesrittemmm= at an alpha of .@& (Criticel Level: .27). 
5. Meuse tudes Toward the Use c? Vcice 

Buestions 4, 6 and & of User Cuestionnejire #2 were 
used to measure the speaker’s attitudes toward voice 
technology. The results (Table oe! indicate a 
cutiStewmeamiy es Sleniticant correlation between high error 
meres aqua -avOrdble attitwde tcmard vycice recogniticn as 32 
Teens Mom csaving tire and reducine the burden on a comrruter 
PweECCOMm——mmOGaL ber CLOUCS Of responses Te these questions and 
asscciated error rates are depicted in Figures ?&-c¢. 
Multiple ccmparisons between the grours chewed differences 


between these whc would always use vcice and those whe werld 
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seldom use it despite its Dronouncéd advantages, and between 
those who frelt that the advantazes of voice will give the 
Keyboard operator other i058 and those who disagree with 
mem) aumemui1 luce. “~leeretore, the null hypothesis cannot ve 
BemecteCd sla Lerms cf a speaker ¢ ae ucee cencernine the 


? 


inflrence on a data BrocessoOr 


i 


Voumecee FO. VOlve 


ry 
(D 


cognition. Cn the other hana, a stearer’s willingness to 


‘es 


in 


e veitce recogniticn oé€cause of his/her beliefs in its 
Requisite dqdvyanteges will artect errcr rates. 

As was noted earlier, the presence of a positive 
COmeclctmecn —emears to be cOrtrery to popular beliéf. Cne 
would imagine that a user whe believes voice reccgniticn can 


Take the jot of a computer operatcr easier (Question #4}, 


1 ‘CG 
adn et? ee 





Mcvtc temas tCward better recogniticn accuracy. Questicns 
Six and Eight were esked for the purpose of determining if 
a user’s error rate might be influenced by the subconscious 
Preven tiuemcor, E€ncurtering additional eduties ~becavse of the 
efficiency and effectiveness of voice irput. Bvt, despite 
the possibility cf additional tasks, potential users still 
would prefer voice to manval entry. However, the trresence 
ewe ciemericartepecsitive cerreleticn ray cnly he attributed 
POrmee UNMegueness of the situation; f€. as in speerer 
PombicCipemmsen Subjects who pretessed a strong desire to use 
vOlce regardless of consequences mey have tried too hard for 
moe di@eeeeracy dnd aS a result have failed to sreak ina 
Saeturdtmmanner . 
E. Attitude Toward Cornuters and Informaticn Processing 
In resfonse tc two sets cr? questions, subjects 
provided bier Settitudes surrounding thes necessity of 
POmrulLehtomin tOdays SOcCietTyY dna how voice technology would 
ete on Precessing c€r data Input. Attitudes towards 
Comeutememwrell into three general cetegoriec. 
Pewee erSonS who zte€el Ccmputers are unnecessary. 
[ee rsons thats teel computers ere necessary in 
SCeclety, CULlsadmeraOtCea Panacea Por ali problems. 
Gees inose who teel that computers are an absolrte 
mMecess ily. 
AUCGL tates toward voice recognition and information 


Puocessimemresulted in tour Categories. 


1SE 





a. These pvelieving that voice would take mere tire 
for information or data processing. 
De inose with no orinion. 
c. Those who feel voice will save sore tire 
d. Those who feel voice can save immeasurable tire 
compared tc conventicnal methcds cf data ertry 
and eee enor EP EOeGes Sie. 
Results of the analysis are sutmarized in Tatie Xxkzi. Besed 
PRU CesemeRe SULTS, the null hypothesis cannot be rejected ard 
mis, luenoy ctercOnuCinudeGd that the crinion or attitude a 
Merson ~pessesses tcwards ccmputers, anc their feelings 
pertaining te voice aS a tire seving advantage wii} not 


meee cClmumers TeCccegnition accuracy. 


TAB LAAT 


AFFECT DUE TO ATTITUDES TCWARD COMPUTERS 
AND DATA PROCESSING 
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feectateweric | .7e | Se : 
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eoriteedl | ! : 
| Level yee 7 215 
| ----- --- - - - -- 4+ meee ee + meee : 
Correlation | | 
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| “% = Signiticant at stated level of significance 
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G. VOCAEFULARY ERRORS 

As a result of using different nurbers of syllatles in 
Doe VOCameuacary, it WaS also pessibdle to get an indicaticn of 
how well utterences with aifferent rnurbders of svllables were 
recognized. UEliginaviyesceme im a Lonzsitudinal study [Ret. 
Bee pp eoeee| it is enalyzed “within the context of this 
mooi Cim~es MNiunyner Veraitication Of these earlier results. 
This is shewn ty weeks in figtre $1 and cver all conditions 
Pie i2uGemec. Both tigures illustrate e generally declining 
Brewer eceumemasS © flinction of the numbes cr syllables in the 
lieGerance. Plthovgn the current excerimnentation yielded an 
ee cktmurery 1.5 percent rise in e@rrcr sate @rcem three ic 
moun syumeemelOS, 1% 15 NOt a large qeviction feom the earlier 
Srudy When indicated little change in errcr rates between 
Baree OQumeeur S7Villebies words. 

[no erewmas or overall SECC PEVeness , é eb Ce ae 
Soelicawmuem “@wculd dictate the leest e@mcrnt cf recoeniticn 
PTR Orc mene rerore, an Error rate of &.9124 still reéerainos twe 
to three percent better than wtterances with @ smaller 
veered dmemeantent. Desrite the higher rate for four Same 
Compered™ to five <¢ylleble words, the aifference is still 
Pesce tcemmerdat Of ONE LO four Or twe to four sy!ilables. The 
Variety cf vecabulary items used in this experiment frrther 
Coun iMniecmmene Argument that through a careful énd judicious 
Selec wwemme. vccatulary items, large vecabtlary difficulties 
and associated high Error rates rey be reduced. 
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Figure 1. Mean Frror Rate vs. # Syllables (by Week} 
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VI. CONCLUSICNS 


Pollowine the lengtny eClaberation of results in the 
Peeve meucmmseccCction itswculd be helpful te recapitulate, in a 
poet cWmmary fOnmM, the responses of the different variables 
tested. Variatles ean inc in a statistically significant 


test statistic included: 


== Me@mmem OF training 

-— experience Of the user 

pate mewemons CONMTUTeCr experience 

-- lewemect Educaticn (all subjects, 
—— Vitvemeecaracity 


ee Seer CCCberativeness 


The following verijabdles Eroduced é Slenificent 


COrrélatiom petween itselY ane reccgnition error rate. 


=~ Prewma.s computer Experience 

—— hitem=er the week 

=> sex bemverce of the user 

=— fevem, of education (é€]1 subjects, 
== cfeower tarticipation 

=o V Wecmeecapacity 

=~ Nomemot air tlow 

=—- State anxiety (first week only) 


—— USE€y attitudes pertaining to voice 
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The following variables resulted in either a non- 


Ceo t weamr: Lest Statistic and/or carrelaticn coefficient. 


JOterunectlion 

Brawen af service 

JoOuesatisfaction 

pete ce Satistaction 

Foreign language competency 

Tire of day 

Time ot week (test statistic only) 


Semmes uUSe of Veice eCaulpment 


tJ 


Level of Education (naive users) 
SOeso-cconomic class 

Lema vere 

Race 

idioma Status and family size 
Hewmesous Creference 

a Cee. 

Pheace c* birtn/gecgraphic crigin 
Age 

Height and weight 

Romero cirflow (test stetistic) 
Physical conditionineg/speech trainize 
Anxiety: State and Trait 


Sreaker ccorerativeness (correlétion) 


Speaker participation (test statistic) 





prime cu, Of Tecofnition errors 


== Atumeudes toward computers/datea rrocessineg 


lime srane@e in verrer rates, -2&@ tc 15.7 percent, for 
the individtal svbjects (See Arrpendix J for a complete 
surmary) indicates an obvious variability between subjects. 
miata neeeme  CONTEXt Of the main experiment and tae associated 
POOVAR tme three variatles of job function, training metned, 
and experience (triels), are irdependent events and ars 
mretvecteq, ircm contfeatunding dve to the eéxperimrentel design. 
The selection of a level of significance €qual to .@2 is 
Terme lveswamewew a pessible existence of some effect, rot to 
Mmemonsttmave a Tligdrous test Of 4 stated hypothesis. As the 
eat yolemmeresresses to the extracticn of nurerous cther 
ia@urmen Meerevers, these protections end the eccompanvineg tower 
fied PpakcmeTriC TeSt are reduced. miso eCes r1Sbences cr 
poe Meermmor dad COSSible dependence between conaiticns 15 
Mewessamemerior tO ré€éaching dn wultirate conclusion. 7: Og 
exemple, were those subsets of a Category achieting 
etetlStimca: sivgniticance also trained with supervision 
and/cr experienced users and if sc, how many were in that 
Darticulam SudDSet? 

The results presented herein suggest that speaker 
VonheDimmemaswOould NOC atfeet recornition accuracy to such én 
EMEC OmeTeCIude its USE among cHoly srecially selected 
users, Tamer Tone Men TavleteaslhaGiiitary applicaticns, this 


Breve scum~gmuc E€cpecially sdtistying since it.wovld negate the 


ad 
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Sacer rOm the necessity of classifying persannel intc 
particul ag military occupational specialties or 
subspecialties BO Ue  Extress E~urrose of Operating voice 
equipment. It is apparent trom the experimentation, and tne 
diversity ofr skills ana experience contained within the 
scarple population, that practically anycné may ne & 
potential candidate to operate voice recogniticn equipment. 

The thrase ‘practically anyone” should be cualitied 
rere. Intersreeker veriability hea a significant impact ix 
Meeweadse Of One subject, who possessed a severe sreech 
impairment, etutwmering. It becare obviots in the early 
Saeco omemmmtrainving that he would te unable to finish the 
training phase. [tie tet. set ver Loc minutes, only il 
Miecerdmees hed been satisfactorily placed into memory. 
PeercUsSuee tne Mwpcdividual was eliminated as an experimental 
mmectmmn lS sditficultye @eronstretess that althevgh Post 
Grepgeue weeadn use this tyre of technology, there will alweys 
Gramet tmesce, albeit few im number, who tor one erceptiocn or 
emether ere unebtvle to attain a Svuitaebie level of reéccgniticn 
accuracy. 

THe Securrent experimentation has clearly shcwn that 


ipjime Swelce ecavipment can 


ov 


SExeericewee and mrethoa of tr 
orev wee wer ceilent recognitiom accmracy rates. Otay COUrse. 
what determines an “excellent” rate is fEurely subjective and 
Cepeommrdeate Urcn the application in which enrplacédc. aia t 


Makes this otservaticn readily arpealing is that Ddctk 








GQ@Meatactenicstics are controlled by the huran. They are f0t 


Tactors thet one is born with cr has inherited. Rather, 


Wweeth Closely supervised training Crocedur 


(13 


on Cy an 
experienced operatcr, a ‘naive’ user cen auickly attein 
recoenitwem rates esreater than 9Y95 Rercens and with 
Meme orve —ExXPerience dnereese this accurecy until errecrs 
meeerednerea Fe Less than two percent. It wusSm be reitereted 
that in the present experiment, subjects were net allcwed tc 
retrain the reccgnizer during the three weexs of recognitior 
testing. ite ademmality, the sreaker wculc retrein an 
Peematm@eemreatrer than tc continue incurring mis- cor fnon- 
Pacer ninemeem. Errors. 

To a Lesser degree, speaker cccperativeness and amount 
Dime revereus COMeEuter experience sare definitely factors ta be 
SOusidered. The latter charactéristic influences Fie 
Pomme SELECTICON  ~precessS ;yhile speaker cocperativeness, 
Poe iremning and Experience, can te influerced by, the huran 
See en te emuandeey, OeCAUSe Cr data Procescsaage experlence, 
Such individuals can readily identify with the édvantages of 
epeech input and thereby técome a more or hiehiy cooperative 


Emeenme camer us combined, these two factors Stronely support 


Piemeomemrral fOr achieving high reccenition accuracy. 


The presence of occasicnal positive COrrel ation 
COEITicmenic, acpmuew CT Cimee Teas tl Caley ees) enitficant ~~. are 
feewewlt to Explain or resolve cemewnic ive lv. Sieg 


Miwadccemecs 12VE1l Of Pperticiraticn, desire to use voice, 


merteeorveareaitt " 





Piedtitmmcaes pertaining te voice, provided misleading 
mesul ts. Tt was surmised that Speakers who ere willing 
pamuilGiveamts and tind voice to Wer a technology that they 
would likely use, would achieve low error rates. The 
observation to the contrary, suproses that rany of those 
speakers tried tceco hard for perfect recognition accurecy, 
Gmemas a@mresult, were less éf¢t tc syeak naturally. Nos! 
ErieeCt,ueemey were trying tO outsSrert the rachine. 

Thus, in ac operational Environrent it becores incurtent 
meocneeetGmee CNEe Speaker end the supervisor to fully embrace 
DaemcOteeets OL VOiICe technclogy for use in a tracticel 
apie i Ccayeren. In demenstraticns at the Naval Pestagradvete 
Semool feris frequently noted that Olhservers are genuinely 
mime cseamntith the capabilities of veice input cf data until 
thet one Error, soretimres after rore then 2@0 successfrlly 
Roeeocthmecm ULTLerances, @ecCcurs and they sit bacr and rerark 
thet perhaps additional research is needed tricr te placing 
it into operationai use. it is obvious that voice 
technology is acceptable for use in a militerv commend 
Cemrer wcna Must VE PUully subppoe.ed by the Commander and his 
See tweet 1S, Error retes Caen “be Winiwized by huran 
Comeuorcmemctcn aS trainifg and exrerience. In conclusion, 
consistency may best describe the key Lo speaker 
Vapia bane y . Atel bmees , Mimaluime, aud experience together, 
Pm cememmcIStTeEnCy im Speech ard “consistency generates «6 


Comrriwemmnlenh recc2enition accuracy rate. 
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APPENDIX A 
USER CUESTIONNAIRE #1 


NAME: SUBJECTS: 


——e we ee ee ae ee ee ee eee eee ee eee ee eee ee ee ee eee eee eee ee — eee eee SS ee ee ee a 


Pee URR@eese sol UNiS Guestionaire is to obtain inforration 
Eo YOu moeaeimne Chysical “therecteristics, personal 
background, and opinions pertaining t0 voice recognition 
ecvirpment and its use. Your answers will assist in 
Se tCeompeete es aethese rersonmal and/or frhysiologicel traits 
SCOVnUrmem- te effective utilizaticn of voice recognition 


ec) pment. 


mene  ecmeogc ercitid® multisnle choice, YES/NC, reting s¢cele 
end short answer fone or two words ONLY!) types. 
Perot mem- es dance accompanies @€a@ch question or block of 


vec t 1 oes. 


Your neme is NCT required but iS requested in order to ease 
Poewereg@eescer’ correleticn @: your revlies with ycur results 
in the Exrerimentetion. If you desire anonyrity, pleese 
PeSPOuOmm~—~m~r tn. your SUBDJECT nurber only. Please respond 


truthfully. Check yovr questionaire efter completion tec 


insure you ve completed all the questions. 
Pioneer Or your aSsistence in this experiment. 
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Pipaguestimons 1 - <<z, Roveaes ClU Der a one or two word 
mesponsemm™enh piace an X by the apprepriate answer. 


i. .wepeis yeur age? 


oe Mae S your height (in inches)? 


—Eeee eee ae ee eee ee OE ee 


en Wheweers ycur weight? 


4, What 1S yceur race? 


White (Ceucasian) 


Yellow (Asian/Mongcloid) 


Black (Negroid/African) 


Rea (American Indian) 


m. What is yeur nationality? 


Native Citizen of tne United States 


om ee Oh ee of 


NWavuralized Citizen of the United States 


Aeley 


Wiemmes your religious preference? _ =... 
(See Attaechea Sheet) 


Q) 


*. WRAL is your ethnic baccground? 


Peertroe Rican 


Peel cl no 


Mexican 


Cuben 


ee. Latin American (versons frem Central or S. America} 
Other Hispanic Descent (Extrection not delineeted 
as Mexican, Puerto Rican, Cuban or Latin American) 





SKimro 


as ee = oe oe 


Alevt 


Tndilan 


Melanesian 


Chinese 


Japanese 


Korear 


oanwe= ow == «= 


Folyresian 


Vietnerese 


Cther Asian Descent (Fxtraction not delineated as 


—en_eas = a 


Cempese, Japencse, Korear, Indian, filipino, or 
Vietramese) 


Nene of the “bdcve 


Cther (Please specity 


em ers ae = —ew ee ew es sf ee eee oe ee ee ee ee eee eee lee 


©. Do vyeu have an accent? 
ae Meomiwhet <igmd?  ...._....._} 
NO - 


UY. e8het is your Marital Status 


Married 


Dasworced 


Single 


awe @e == «= 


Cther (separated, widowed) 


me. “HeOwemeny children®do you have? 


ew Gwen = = 
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— oe or = of 


11. Do you wear glasses? 


WZ. Have you ever had orthodontist care &/or wear/wort 
braces? 


12. What is your Level ot educaticn? 


Non High Schocl Graduate 


Hish School Graduete 


Asscciate Degree 


ame er eo o- oe 


feavedmec: CCllere 


<e yeers of college 


Seevedrs O01 college 


4 years ct cclleve (no deeree) 


College greduate (BA/BS) 


Gradvate work cf more than 1 year (no degree} 


Masters Degree received 


mae = a oe 


Doctorate Degree received 


14. Whet state were you born in’? 


aes ae TF ee ™ ae Pe ee awe se awe Se ew ee awe = a 


Doe Jumene aees i-12, in what <¢tate did yor princidally 
mest ae: 


ower e- owes erw = awe aw ew oe ow == e~rwrewewew a= ow ew a= a 


Teo 





Peeerame NaS been your State of residence for the mejority 
of the last three years? 


I ne el 


17. Do yeu speak arv foreign language(s)? 


Y=S [which one(s) ] 


—_— — =— = — me eee eee eee ee ee 


meee Weetm.S Your brarch of service? 


Navy 


Marine Corps 


—_—e ie SF on 


Air Force 


—_— es ae e= «= 


Cther (civiliar) 


—weE == == = 


19. How meny years heve you been ir the service? 


=~ sew ©: OFS ae e == 


ZO. Have you ever creen overseas for mere than 1S 
EOmeeccutive months? (not including leeve or vacavion: 


YES (ge to aquesticn #21) 


NO (go to question #22) 


Ete HOwemany fronths were you overseas? 


—_en oe ee Oo oe Oe =o 


iTyew oat ccuntry? 


—_— ae oe ee ee eee ie ee Oe ee 


Poorer Oo 7CU CCHOSider to te your socioeconomic class? 


Lower Class 


Upper Lower Class 


Iewer Middle Class 


Middle Class 


Upeerariadile Class 


}~ 
(y 
~~ 
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ee How wou 


d Onde Gite RECoeML LION Eq Lemen v area 
infermeticn e 


Seer rnaava Tnpucs 


eee ee eee ane Ss se SS = a Ge meer Pe a ee ee SS ee ee ee ee ee a = Ge ee ae ea ee ee ee eee a ee eee = Eber aes ae ee eee ee ee 


. A LOT SAVE SOME NO OPINION TAKSS MCRE TAKES A LCT 
ME ena DONT KNC¥ TIME MOR Eire 


Pome lievemece fecosgition can seve time. it wovld allow a 
moacOds Ome >eTa. Or we dc other jocts. 


=— ame ee abe ae oe eee a ee ee eee oS _—_e es a ee ee eee ee ee ee ee oe ee =— a ae ee ae ee ee ee ee ae a ee a ee es a ee ee ee eee ee a 


PaACIDEDLY SaeGRITLY NG CPINICN Se) Gaui) DECIDEDLY 
De ES ACREE CONT KNCt DISAGREE LISAGRES 


ree, reece FOE use Of VOlCEe reccsnition equirrent. 


i { 1 


_—-— see Ss ae -—- a ee ae ee —_—_——_—- oT ee ee ee eee ee ae oe -_ oe ie ee ie Se i — Ee ae awe ae ee Ses ee Se 


i ! J | 


VERY EASY ZASY TC NO OPEnicn DEREICULT VERY 
poe = Uses Le US = DLE cw es 
DO? Usk 

Powe JCu thiuk Of voice receznition equiftrent tor 
se tie vary Commana Centers? 

— eee ee I ae rene ar | Oy a ee ad es 

J | { i \ 
ee Y SCr ny HAT Wwe, CFrINICH SOMEWHAT ar 
Ban 1 LOR Paee. ICAL DON’T KNCW Hee CI CAL IMPRACZI Cre 


owei@mmemme nm Lrevlous COMputer Experleuce have you had? 


— a ee ee oe ae ae ae ame ae er oe = a -_— oe ae oe ee oe ee ee ee ee = oe ee ee ee Oe eee ee ee ee - <a Ge abe OE ee ee Se Ge ee 


aLOT CF CONSIDERABLE SCVE 'Bo 
eee: Sees, = EAPERT ANCE PAPERIENCE EXPER 





Cee wenicemmees YCUr Lrevlious Experience with voice reccgnition 
eqvripment? 


VERY “MUCH VUCH SOME A LITTLE NONE 


Eo) GOeeewould additionel e€xperiénrce with voice recegnition 
emi pmem@peaitect recognition accuracy? 


—_ ee ee ee eee ee ee ie ee —_—e ae ee oF am ee es em ew se a= 2 =~ eee am oc am Se a= ow Oh eo ew a = _— oe oe eee oe oe oe ee Se oe 


‘UCH oCMz NO CPINICN A LITTLE NC 
INPROVEMENT IN FROVEMENT PPG y asl Taree ee 


Sere Owe yOu feel when @ MiSTrEcoegnition occurs? 


S'RCNGLY k= , NIUTRAL Peo Sia Cita 
ies 2 Diels 


fee Lowego you feel when ¢ non-recognition (“beep”) occurs? 


—_—om ae a ee oe ewe a ee oe ee eo ee oe _— = oe ee ee ee ee oe ee oe —_— SB ampanm s= = =m = am ewe =a ae ae == — Ge ae eee cee eee eee eee a ae ee a ae 


CNGLY Die NEUTRAL eo ae STRCNGLY 
x Die 


eer 2oweego you feel when @ recogniticn occurs? 


STRCNGL? LIK? NEUTRAL DISLIK:# SI CNG ry 
Liss VIsilas 





Dome 2cSerrce yOUr Participetion in this experinrent. 


EXTREMELY § MCDERATELY COOPERATIVE SOMEWHAT VERY 
COCFERATIVE UCCPERATIVE UNCCCPERATIVE UNCCOP- 
ERATIVE 


SoemecOwmecuid yOu deScrite your participating in this tyre 
Cee SP eT tee Utallion: 


=— oe ee we oe oe Oe eee oe oe oe ee oe —aw ee awe WEP om aw awe =& oF ae a> — a @— ee ae ae oe a ee ee ow ae oe 


ST RCNGLY IIka NEUTRAL DLS ee Selene iG Lat 
LIK: Vio eke 


cveeevmctmers HOUT CUrrent physical cendition? 


ape =e Gp ewe Gee WP Gee oO we eG eo oe ee oe =e Pe aw SE awe oe aw Se ws = a == ~—— oe ae cee cee Gee a PP ewe we Ge ee ee aoe one Ge ome eee Gee Gee ow eee SS oe Seo oe oe 


CUTSTANLDTING GCCD AVERAGE FAIR POOR 


Sree lo emeerce TeCoeniticn does Save tire and allows YOU tc 
be essigned other tasks, how Griten woulda YCU went to use it? 


ELWAYS FREQUENTLY NOW ANZ THEN SELDOM NEVER 





AY Paige A ce 


Usik QUESTICNNAIRE #2 


NAME: SUBJECT#: 


~~ cee ee ee ee ieee ee ee ee eee eee ei ee oe — ——e ee ee 2 eee 8 ee eo eee 


Pole’ CI Gis: 
DMemolimmeremct this questionaire isc t0 obtain information 


irom you Pesemeine veysicel therecteristics, personel 


> 


in 


Poe romeo Orinvenus pertaining +0 voice ré€cognition 


EQVLUILMENT ang its uS€E. MOUrE Jca@swers will SeSsis% iin 
SeC-Th ieee mee mere ,eore@nel and/cr thysicelogical traits 
memer i bCuverE tc meective itilizeaticn Gor Voice recegniticn 


Ed vi prentT. 


mee GUSomnonksS LOCLLGe gulmizy le ChOiCeGm YES/NC, ratime <calg 
end Spalseet! exswer ‘ore CemeiwO words ONLI!) tyres. 
Peewee Pome Uld a0Ce acCCORTdnies E€ech question or clock ot 


ies t1cmee 


tcur mame is NCT required ort is requested in order to ease 
Pecmeitie@oreeary Ccrpeletiog cf VYeour replies with ycur results 
PeeeeeCewceerecrcmentetion. If yov desire anonyrity, pleese 
Beeeema Meth Cur SUBJECT number cnly. Please respcnd 
TOUtdrwety. Check your aquestieneire efter completion to 


Miiewre Mea ve contleted ali the questions. 


tae -YOUNs Or your eSsistence in this exrerinent. 
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Memerestmons 1 - S$, previce EG1ther a one or two wora 
response, cr place an “X° by the apprcpriate answer. 


Pec cvemmroOune ver fad One Or; Tore or the following speech 
Miemecdifeaos ald/oOr imrpairnénts? 


Mob cCulelicCr (daar 1Culty ir proneuncing vewels 
ana/or conscnants) 


— STP ee oe ee 


Yeice (irregularities in the larvwnx) 


Cieft lip ana/or lip palate 


temeteal palsy 


Houtvverine 


Hearing impairments 


Aphnasia 


Moncemmsal <peech derects (hue tee birth/préenency) 


Retardation 


None of the abdove 


Pero vemmeormever Teceivea sEpeech UWrerapy from either a 
Peesidgmacamen sree) clinic, private speech therapist, cr 
parouehwpoe putlic school system? 


5. keve you ever réceivedad voice tréining or taken Singin 
lessons? 
= lens (Hew many years? __—_—s») 
NC 


led 





ecqeesemn>s + - 1S place an X on a point on the scale 
that best indicates or describes your feelings. The xX may 
be placed anywhere élong the scale. 


4. hcew woula voice recognition make a computer operator’s 


— 2 oe Oe e- Oe awe 6 oe ee Se ae ee | we 8 C2 awe ss Ce ev wee a eS ee oe a= C= oe ea ee 6) ee ee ae 2 eee = Ome oe: ae ee c= Cee C= aw & o= a= 
‘ 


MUCH SCVEWH AT NC CPINICN Mota MUCE MCRE 
BASTIER EASI{EH DIV PTCURT DIFFICULT 
a, How WO voice iselaysinlil a itona equipment ei fect 


Pmievorhdauileem EYrCccessifg or data input? 


o~— ee owe owe ew Se ee we ee ee eS Se 


SAV2 A LCT SAVE SCMY NQ OPINICN TAKES MORE TAKES A LCT 
Ob TIME TIME DGN“T KNCW eles MORE TIME 


a. Mieperrece.-COf%i4100 Cab save tire, it would allow a4 
meycucdrdmmemerdtcr 4o €Go ovher jcods. 


o~ a= oe Se Oe ee oe Se Ge ew ese Se = ome wo) ae a ae = 6 6 we ow ow we es i es i oe ai is ai oa ce we ee Se aie ie ee a ee 


eC LDAULyY S2tGn2 Li NC CFINION Seed iy DECIDEDLY 
AGEE: AGHES DONT KNCW LISAGRi& DISAGREE 


oa ee ee Oe OP ae: = a SO = = CS = a= ae ee ee eee Se = eo 4 ee Se Se ae owe Oe Se ee ee Se a ee ee es EE eS SE om oe Ee Se ce eee Oe Se eo eo =] a= eS 


Meat East fAOY TC NC CEINICN DIFFICULT VERY 
TQ USE US 4 TO US& DI#FICULT 
TO USE 


15 





ewe stevenrce Tecoznition does save time and allows YOU to 
Pema sot wee OLNeEr Tatks, how otten weuld YCU want tc use 11? 


ALWAYS FREQUENTLY NOW AND THEN SELDOM NEVER 


Sa HOwmwWould additional experience with voice recognition 
Pomc menumatieG€ct recognition accuracy? 


Se ow am ee ewe oe es a oe © —- => ae au oe @ @e @ © oe 2 ow © @ oe oe oo eer @eereaw ee @ ewer ee SP oe a= a= == ome Gwe ee ae ee ae oie Se Se ee 


MyCd SOME NO CPINICA ye NC 
IMFROVEMENT IMPROVEMENT IMPROVEMENT IMPRCVEMENT 


1g. How do you feél when a misréeccgnition occrvrs? 


TERCNGLY Pipes NEUTRAL Doon s STRONGLY 
eA Ses 
i1. Eow do vou teel woen & non-recognition (’beep’) cccurs? 

ieee pee NUP a oe ee ern a PER ie a ak Ve PR ee ee oe ! 

\ j { 1 
Sie GLY Linz NEUTRAL Plo blk & STRONGLY 
LIAs Pisk ae 





moaeeeoemaa, you feel waen a reéce@enition occurs? 


or ee oe ae ews Oe © | SS Se Sa ee =~ ame ee ee SS ee ee ae a ee ee oe ome Ge ee Or eee Ce res Oe eee ee —_— oe ae ae ewe eee Cee oe CP ee cee oe oe 


STRONGLY is NZUTRAL Po baron STRONGLY 
pac DP odhiee 


> we ies ee 2 ie Se ee eS ee oS 


SXIREMELY MCDERATE 
GPU Rae OCC FERAT 


mt 


Y COCPERATIVE SOMESEAT eo 
Vi UNCOOPERATIVE UNCOCF- 
ERAT TMi 


Pane COWmm~ouvld you deScride your participating in this tyre 
See ec tel imentleticn: 


=m Ee aw amweamwer G@EPee- a Gee ce = 


STRCNGLY LIKE SN ZUTRAL Pcp STRONGLY 
LIK= DISLIAE 


more Sacra ycu think Of voice Teccznition equipment for 
PS —wloeewehary COniand Centers? 


= ome Se ae ee Oe ae ee oe eee ee 


VERY SCMEWHAT NC QPINICN SCMAWEHAT ce 
PRACTICAL PRACTICAL DONT KiNGW PVPRACTICAL IVESACTICGAL 





APPENDIX C 


SELE-EVALUATION QUESTIONNAIRE 


oo oe rt—isSSs i SUBJECT# | 

DIRECTIONS: aA rumper ot statements which people have used 
to descrite themselves are given below. Read each statement 
and then circle the apprcpriate numcer to the right ot the 
erent That indicetes how you GiNHPALLY feel. There é6re 
ne right or wrong answers. Flease dc rot spend too much 
time on ény One stetement, dut give the answer which seems 


by) GQ@scrite Now you GENSRALLY reel. 


1 = ALMOST NEVER 
ee oeo UNMET IT ss 


= OFTEN 


Cn 
| 


4 = ALMOST aLwATs 


ie eseere pleasant 1 k = aS 

oe leteemer quickiy 1 c c & 

Cee (eeeerm like crylrg HL 2 c - 

4, J] wist I coula be as happy as il < E ~ 
Cumemen seem tc te 

=e leet 52 Be OUT ON things becevse A. 2 2 s 


I can’t wake up my mind soon 
enough 


EL 





1y. 


eu. 


I teel rested 

I am calm, ccol, and ccllected™ 
IW reere thet difficulties are 
Dibwaszeip SC that I cannct 
Overconre ther 


l worry too much over sonetning 
Coewmeuedily dcesn t matter 


tT am hapry 

ivemeernclined tO taze things hard 
Pere Se lit contidencs 

[Dewees secure 


ieee avoid tacing a crisis 
Orecemes icuULTYy 


Peieeer piue 
| Vali eCortent 


DOlgemunircertent thought runs 
bapereafy find and bothers re 


I take disarpeintments sc kKeenly 
(heaters Can t PUL them Out of Fy 
minod 


i am &@ sté€aady person 
ieeeemero 2a State cr tension or 


hwememere aS 1 tChinx cver my recent 
Comeeras and interests 


f~ 


j— 


N) 


AN) NS 


1 


N35 


N) 


Ny 
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Ci 


Cj 


Cx 


C1 


Cx 


C4 


CN 


CA 


cx 
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“y 
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ita 


eae 


re ¢ 
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EVALUATISN 
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Cn 
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Cn 


CN 


CA 
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CA 
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SAFPENUIX LC 


SiLE-H VALUATION QUESTICNNAIRE 


ea lL Le. WP Gees SUB Ghee 

DIRECTIONS: <A number of Statements which people heve used 
EO GQeEScrite themseives are given below. Kead G€ach Stateérent 
ena then circle the eppropriate number to the right of the 
Statememe toat itdicates how you feel RIGHT NOW -- AT THIS 
VERY MOMENT. There are no right Or wrong answers. Pleese 
GO os, Ssj7end tco kuch time on any one statement, Cut give 


Une enswer thet cest descrises your FRESENT feelings. 


eee SA een 


SCVEWHAT 


no 
" 


=FRCWPRALELY SO 


Cn 
! 


4 = VERY MUCH SC 


ieee Loeeeee cair 1. 2 c = 
eo) erecta SCCUre i re C ~ 
oe 1 velmemepensSe 1 z 5 = 
4. 1 am regretful 1 a % ~ 
=e leecelnat Case 1 é 3 ~ 
Sree ieeeeecrs ULSET 7 B S * 
(. I am presently worrying i Z 5 + 


Over possible risfortunes 
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Os 
C3 


I 
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teel restea 

Hoe weanx lous 

Heeie cONLTOrtable 
Mmeeieese Li-C Cnr ident 
feel nervous 

Spey & UETY 

feel high strung. 
é7 reiexea 

feel content 

@r worried 


moe cVer-excited 


Anna tled 
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feel jcyful 


feel pleasant 


Cn 


nN Q) 
CAN Cn CN CN 


do 


ry 


CN 


CN 


CN 


nN 
CN 
if> 
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SCORING KEY 
Or He 
A-STATE EVALULATICN 


1E6 


fs 
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Gx 
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CN 
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CA 
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CN 


CN 
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AFPENDIX & 


UTTHERANCE LIST: TRAINING WEEK - 


UTTERANCE 


THRES 

EUROPS 

Peis [Tl Lab? 
CARRIAGE RETURN 
LOGOUT 

COMMAND 

plant? Cr HORMUZ 
Tees 

KOREA 

Zea 

CHANGE LIRECTORY TO FOOCK 
ALPHA 

POmt IVE 

Doo Trl CATICN 
LAUNCH 

Reo A 1s 

tie A 

eee ORG e CCOMMERDER 
KILO 

LOGIN YELLEN 

ECHO 

NOVEMBER 

TatC 

GrllzD STAIES 
rQUR 

FRAVC 

riggers CIRCLE CN VCSCCW 
BNSMY DeTzCTICN 
FROCKED 

RCWVEC 

igen CON TROLL GR 
SBYEN 

GRCUND CONTACL AEPECACH 
REPORT 

AIHFIELLS NAME 
Teel 

AVAILABLE 

MESSAGE 

SALTELLITE 

SHUG. 

YANEEL 
AFSIRMATIVE 


ey” 


WEEK#L 


Cerner er? 


THREE 
EUROFEL 
MOvrarl LS 
CARR R2TUR 
LOGOUT 
COMMAND 
Seo © aM 
ee 

KCREA 

ZERO 

Ge pere 10 PK 
ALFHA 

Ee Omen ves 

Pi 2 CAT eet 
LAUNUBR 

NORCO ATE 
Dapea 

Perm <2. Gua 
KILO 

LOGIN: Yih 
ECHO 
NOVEMBER 

TO 

UND oes 
Car 

BRAVO 

Pie ol 20 
EN DETSECTICH 
PROG lm. 
ROMEO 
Dee 
SEVEN 

GNE GIL AFPR 
REFORT 

AFLD WAME 
LIVA 
AVAILABLE 
MESSAGE 

See MIKI ed 
SHOOT 

YANKEE 
ATEFIRMATIV:= 


= 
ae 


iM 


“17? 
K vs 
ur 
‘ 





C42 
Gas 
044 
¥4& 
Z4e 
047 
G4E 
C4b 
VES 
Zo L 
Coe 
Wee 
¢54 
Des 
Dee 
ee 
OEE 
C5S 
YER 
Gel 
GEE 
gE? 
ge 
Ges 
YEE 
LEV 
CEE 
ZesS 
OVd 
vil 
GCT2 
as 
G'74 
“7s 
Cre 
O's 
Z7& 
Z?y 
G52 
ve 
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AFPENDIX I 


MASTER LIST OF UTTERANCES 


ONE SYLLABLE UTTERANCES (18) 


Ciuk 
T¥YO 
THREE 
FOUR 
FIVE 
See 
LIGHT 
NINE 
GOL}! 
MIKE 
DCAUNCH 
Aes 
SeU0D 
S Panel 


COURSE 


CN 


J 


~~ 


KURCPE 
LOGOUT 
2 5RO 
SEVEN 
ALPHA 
HRAVO 
CHARLI£ 
DELTA 
>CHO 

PO MERE T 
HOSE 
KILO 
LIMA 
OSCAR 
FOr ee 
CULEEC 
TANGO 
VIGOR 
WH Toe Y 
XRAY 
YANKEE 


el 





CX 


1B 


ZULU 
COMMAND 
REPORT 
OFFENSE 
CELENSE 
ATTACK 
PROCEED 
CANCEL 
MESSAGE 
DISTANCE 
NEUTRAL 
MISSILE 
SENSOR 
REIFUEL 


THREE SYLLABLE UTTERANCES (29) 


OV Malice T 
SOVIxT 

JOINT TASK FCRCE 
NOVEMEER 
JUL 

RCMEO 

SIEREA 

INDIA 

UNI Oh 

KOREA 
NEGATIVE 

POS Tie 
PRECUIEE 
Ria NAME 
ALTIEUEE 
RELOCATE 

LOalt THrt GANN 
LEVEL Two 

Sl eee 
LCRPELC 


FOUWSMOMLELAELE UTTERANCZS (14) 


CARRIAGE HETURN 
LOGIN YeLLeNn 

Si mage cok BORMUZ 
UNITEL STATES 
FLIGHT CCNTRCLIER 
AVAILABLz 
AFFIRMATIVE 

UP IN DaTAIL 


LEZ 





CLOS: CUT CHARLE 
HUMAN FACTCRS 
ALVANTAGES 

LRACK ENEMY 

SEA OF JAFAN 
ACCAT TITLE 


UTTERANCES GREATER THAN OR ECUAL TC & SYLLABLES (16) 


MANEUVER LaLayY 

CRANGE DIRECTORY TO FCOCK 
IDENTIFICATION 

TASK &ORCE CCMIANDER 
PLACE A CIRCLE ON MOSCOW 
GROUND CONTRCL APPROACH 
ENEMY DETECTION 

NORTH ATLANTIC MAP 
MEDITERRANEAN MAP 
PROERETLITY OF DETECTION 
OPERATIONS FLAN 

PACI#HIC DATA BASE 

PLC Teo UrMARINSS 
AUTCVATIC RECOGNITION 
FILa2 TRANSFR2R FRCTOCCL 
NUKE TEEM TILL THEY GLOW 
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INDIVIDUAL SUEJVEULT RECCGNITION RATES 


TUS waegowide are mean error rates fer each subject 
pertici;yating Io. the EXpDErlnrert. The data is 
DattememeedmeGo FITrTOr tne egrcurs establisned ain the 


Overall experimental design and are expressed in percent 
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